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Heisenberg's uncertainty relation for measurement noise and disturbance states that any position 
measurement with noise e brings the momentum disturbance not less than h/2e. This relation holds 
only for restricted class of measuring apparatuses. Here, Heisenberg's uncertainty relation is gener- 
alized to a relation that holds for all the possible quantum measurements, from which conditions are 
obtained for measuring apparatuses to satisfy Heisenberg's relation. In particular, every apparatus 
with the noise and the disturbance statistically independent from the measured object is proven to 
satisfy Heisenberg's relation. For this purpose, all the possible quantum measurements are char- 
acterized by naturally acceptable axioms. Then, a mathematical notion of the distance between 
probability operator valued measures and observables is introduced and the basic properties are 
explored. Based on this notion, the measurement noise and disturbance are naturally defined for 
any quantum measurements in a model independent formulation. Under this formulation, various 
uncertainty relations are also derived for apparatuses with independent noise, independent distur- 
bance, unbiased noise, and unbiased disturbance as well as noiseless apparatuses and nondisturbing 
apparatuses. Two models of position measurements are discussed to show that Heisenberg's relation 
can be violated even by approximately repeatable position measurements. 
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I. INTRODUCTION 

Quantum mechanics introduced the intrinsic discrete- 
ness of some physical quantities represented by polyno- 
mials of continuous quantities. This discrepancy was re- 
solved by the non-commutativity of the canonical ob- 
servables, the canonical commutation relation (CCR), 
found by Heisenberg. Another basic feature of quan- 
tum mechanics is that every measurement introduces an 
unavoidable and uncontrollable disturbance on the mea- 
sured object. Heisenberg's uncertainty relation interprets 
the physical content of the non-commutativity as the lim- 
itation to our ability of observation by quantifying the 
amount of unavoidable disturbance caused by measure- 
ment. 

According to the celebrated paper by Heisenberg [1] 
in 1927, Heisenberg's uncertainty relation can be formu- 
lated as follows: For every measurement of the position 
Q of a mass with root-mean-square error e{Q), the root- 
mean-square disturbance r](P) of the momentum P of the 
mass caused by the interaction of this measurement al- 
ways satisfies the relation 

eiQ)viP)>^, (1) 

where h is Planck's constant divided by 2tt. Here, we use 
the lower bound h/2 for the consistency with the modern 
treatment. 

Heisenberg [1] not only explained the physical intuition 
underlying the above relation by discussing the famous 
7 ray microscope thought experiment, but also claimed 
that this relation is a straightforward mathematical con- 
sequence of the CCR, QP — PQ = ih. Heisenberg's ar- 
gument runs as follows. He assumed that the mass state 
just after the measurement of position Q with root-mean- 



square error e(Q) is represented by a Gaussian wave func- 
tion tp with the spread Qi = e{Q). Then, by Fourier 
transform of 'i/'j he showed that the momentum spread 
Pi in this state satisfies the relation 

Qi^'i > I (2) 

He identified the momentum disturbance ri{P) with the 
momentum spread Pi in the state just after the mea- 
surement, and concluded Eq. (1) (see [2] for the detained 
discussion) . 

The mathematical part of his proof leading to Eq. (2) 
was refined by introducing the notion of standard de- 
viation shortly afterward by Kennard [3]. He explicitly 
defined the spreads Qi and Pi to be the standard devia- 
tions of position and momentum, a{Q) and cr(P). Using 
Fourier analysis, he proved 

a{Q)a{P) > I (3) 

in an arbitrary wave function ip. 

Kennard's relation above was later generalized to ar- 
bitrary pair of observables by Robertson [4] as follows. 
For any pair of observables A and B, their standard de- 
viations, cr[A) and cr(B), satisfy the relation 

a{A)a{B)>^\{^,[A,Bm (4) 

in any state ijj with a{A), cr{B) < oo. In the above, 
[A, B] stands for the commutator [A, B] = AB — BA, and 
the standard deviation is defined as a{A) ~ {{i/jjA^tp) — 
{tp, A^)'^y/^, where (• • • , • • • ) denotes the inner product; 
in this paper, every state vector is assumed normalized 
and the domain of the commutator [A, B] is considered 
extended appropriately. 
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Robertson proved the above relation using a simple ap- 
plication of the Schwarz inequality without using Fourier 
analysis. Thus, it was made clear that Heisenberg's re- 
lation Eq. (2) is a straightforward mathematical conse- 
quence of the CCR. However, Heisenberg's argument that 
leads to Eq. (1) from Eq. (2) has not been justified for 
more than 75 years since then. 

In fact, Heisenberg himself appears to have changed 
his position from 1927 to 1929. Aroimd this time, it was 
already known that an EPR type thought experiment vi- 
olates Eq. (1). In this case, by the effect of entanglement 
between two masses, 1 and 2, the position of mass 1 at a 
time t can be indirectly measured very precisely by mea- 
suring the entangled mass 2 without disturbing any ob- 
servables of mass 1, and hence if the momentum of mass 1 
is measured directly just after the position measurement, 
the momentum at the time t can also be measured very 
precisely. Heisenberg's response to this criticism appears 
that he considered the uncertainty relation to be Eq. (3) 
rather than Eq. (1). He stated, for instance, that "every 
experiment destroys some of the knowledge of the sys- 
tem which was obtained by previous experiments. This 
formulation makes it clear that the uncertainty relation 
does not refer to the past" (p. 20, Ref. [5]). Heisen- 
berg's response means that even if we can measure both 
the position and the momentum at the past time t very 
precisely, after the momentum measurement the mass no 
longer has definite position so that Eq. (3) is not violated 
at any time. 

Thus, in a few years after its first appearance, Heisen- 
berg's uncertainty relation might turn to be a more for- 
mal relation like the CCR than what Heisenberg claimed 
in 1927 as the fundamental limit on our ability of mea- 
surements. However, this change paid a high price of 
confusions among physicists as well broad scien- 

tific community. In fact, many text books have as- 
sociated the formal expression of "Heisenberg's uncer- 
tainty relation" to Eq. (3), but also associated the phys- 
ical meaning of "Heisenberg's uncertainty relation" to 
Eq. (1) by illustrating many examples including the 7 
ray microscope. Such a view has been accepted for many 
[6, 7, 8, 9, 10, 11, 12], whereas the universal validity of 
Eq. (1) has been also criticized in many ways [13, 14, 15, 
16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]. Thus, 
it is still a significant problem to clarify the limitation of 
Eq. (1) and to generalize it to obtain relations that hold 
for every measurement. 

Just as Heisenberg [1] argued that the mathematical 
relation Eq. (2) or Eq. (3) concludes the physical asser- 
tion expressed by Eq. (1), Robertson's relation Eq. (4) 
suggests the following physical assertion: // an appara- 
tus measures an observable A in a state ip with the root- 
mean-square error e{A), the apparatus disturbs an ob- 
servable B with the root-mean- square disturbance r]{B) 
satisfying the relation 

e{AUB)>^\{i^,[A,BM\. (5) 

We refer to the above relation as Heisenberg's noise- 



disturbance uncertainty relation. Many text books of 
quantum mechanics have regarded Robertson's relation 
Eq. (4) as the rigorous formalization of Heisenberg's 
noise-disturbance uncertainty relation, even though with- 
out claiming the universal validity of the latter. The pur- 
pose of the present paper is to clarify the limitation of 
Heisenberg's noise-disturbance uncertainty relation and 
to generalize this relation to a universally valid relations 
for the noise and the disturbance. 

It is not a much surprising fact that we have not ob- 
tained even a precise formulation of Heisenberg's noise- 
disturbance uncertainty relation. Robertson's relation is 
a universal statement for any states and any pair of ob- 
servables. However, Heisenberg's noise-disturbance un- 
certainty relation is a universal statement for any states, 
any pair of obscrvablcs, and, in addition to those, for 
any measurements or any measuring apparatuses. Since 
von Neumann's axiomatization of quantum mechanics [6] 
published in 1932, we have definite answers to questions 
as to what are general states and what are general ob- 
servables. However, the question was left unanswered for 
long time as to what are general measurements. 

Towards this problem, Davies and Lewis (DL) [30] in- 
troduced the mathematical formulation of the notion of 
"instrument" as normalized positive map valued mea- 
sures, to which we shall refer as DL instruments, and 
showed that this notion quite generally describes the sta- 
tistical properties of a general measurement, so that for 
any sequence of measurements the joint probability dis- 
tribution of those measurements are determined by their 
corresponding DL instruments. 

However, the question left open for some time as to 
whether every DL instrument corresponds to a possible 
measuring apparatus [31]. In order to solve this ques- 
tion, the present author [32, 33] introduced a general 
class of mathematical models of measuring processes and 
showed that the statistical properties given by any such 
model is described by a normalized completely positive 
map valued measure, to be referred to as a CP instru- 
ment, and conversely that any CP instrument arises in 
this way. Thus, we naturally conclude that measure- 
ments are represented by CP instruments, just as states 
are represented by density operators and observables are 
represented by self-adjoint operators. 

In this paper, we start with presenting the above char- 
acterization of measurements in more accessible form. In 
Section II, we propose the two axioms for general mea- 
suring apparatuses, the mixing law (of joint output prob- 
ability) and the extendability axiom, which characterize 
the statistical description of general measuring appara- 
tuses. In Section III, we introduce mathematical mod- 
els of measuring processes, called indirect measurement 
models, and pose the realizability postulate. Under the 
above three axioms, we show that (i) every apparatus cor- 
responds to a unique CP instrument that describes the 
statistical properties of that apparatus, (ii) conversely, 
every CP instrument corresponds to at least one appa- 
ratus, (iii) the correspondence is a one-to-one correspon- 
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dcncc up to statistical equivalence of apparatuses, and 
(iv) any apparatus has a statistically equivalent appara- 
tus which is described by an indirect measurement model. 
Thus, we establish the notion of "instrument" as the 
function of a measuring apparatus by the mathematical 
notion "CP instrument" that represents the statistical 
equivalence class of a measuring apparatus. In the above 
sections, we are also devoted to explain how the notion of 
CP instruments integrates such notions as effects, opera- 
tions, probability operator valued measures (POVMs), 
and trace- preserving completely positive maps, widely 
accepted in the field of quantum information [34]. It 
should be also pointed out that since in an indirect mea- 
surement model, the measuring interaction is described 
purely quantum mechanically, the above results provide 
a useful approach to explore statistical properties of gen- 
eral quantum measurements using quantum mechanical 
laws. 

In Section IV, a mathematical notion of the distance 
between POVMs and observables is introduced and the 
basic properties are explored. Then, we formulate the 
notion of measurement noise and obtain the basic prop- 
erties. In particular, we clarify the meaning of noise in 
the indirect measurement model and show that this no- 
tion is equivalent to the distance of the POVM of the ap- 
paratus from the observable to be measured, and hence 
the noise is independent of particular models but depend 
only on the POVM of the apparatus. In Section V, we 
formulate the notion of disturbance caused by a measure- 
ment and we obtain the basic properties. Disturbance is 
rather straightforward notion for indirect measurement 
models, while it is not clear whether it is model indepen- 
dent. We show that the disturbance in a given observable 
is determined only by the trace-preserving completely 
positive map that describes the nonselective operation 
of the apparatus. In Section VI, under the formulation 
provided as above, Heiscnberg's noise-disturbanco uncer- 
tainty relation is generalized to a relation that holds for 
any measuring apparatuses, from which conditions are 
obtained for measuring apparatuses to satisfy Heiscn- 
berg's relation. In particular, every apparatus with the 
noise and the disturbance statistically independent from 
the measured object is proven to satisfy Heiscnberg's re- 
lation. Under this formulation, various uncertainty rela- 
tions are also derived for apparatuses with independent 
noise, independent disturbance, unbiased noise, and un- 
biased disturbance as well as noiseless apparatuses and 
nondisturbing apparatuses. In Section VII, we examine 
von Neumann's model of position measurement to show 
that this model typically satisfies Heiscnberg's relation. 
Then, we examine the position measurement model that 
was introduced in Ref. [19] and show that this model vi- 
olates Heiscnberg's relation uniformly. The above model 
was shown in Ref. [19] to realize Yuen's contractive state 
measurement [16] and to break the standard quantum 
limit for monitoring free-mass position claimed by Bra- 
ginsky and collaborators [10, 11, 12] as a consequence 
of Heiscnberg's relation. An experimental proposal was 



given in Ref. [35] for realizing the above model in an 
equivalent linear optical setting. In Section VIII, based 
on the above model we show that Heiscnberg's relation 
can be violated even by approximately repeatable posi- 
tion measurements. Some discussions in the final section 
conclude the present paper. 



II. STATISTICS OF GENERAL QUANTUM 
MEASUREMENTS 

A. Postulates for quantum mechanics 

Throughout this paper, we assume the following postu- 
lates introduced by von Neumann [6] for non-relativistic 
quantum mechanics without any superselection rules. 

Postulate I. (Representations of states and observ- 
ables) Any quantum system S is associated with a unique 
separable Hilbert space Hs, called the state space of S. 
Any state of S is represented in one-to-one correspon- 
dence by a positive operator p with unit trace, called a 
density operator on Tig- Under a fixed unit system, any 
observable of S is represented in one-to-one correspon- 
dence by a self-adjoint operator A (densely defined) on 
Hs- 

Postulate II. (Schrodinger equation) If system S is iso- 
lated in a time interval {t,t'), there is a unitary operator 
U, called the time evolution operator, such that if S is in 
state p at time t then S is in state p' = UpW at time t'. 

Postulate III. (Born statistical formula) Any observ- 
able A can be precisely measured in any state p in such a 
way that A takes the value in a Borel set A with probabil- 
ity Tt[E^{A)p], where E^{A) is the spectral projection 
of A corresponding to Borel set A. 

Postulate IV. (Composition rule) The state space of 
the composite system S -I- S' of two systems S and S' is 
the tensor product Hs <8' Hs' of their state spaces. An 
observable A in S and an observable B in S' are identified 
with the observables A^I and I^B, respectively, in the 
system S -I- S'. 

For any unit vector tp, the state p = [■(/')(V'I is called a 
vector state represented by ip. In this case, tp is called a 
state vector representing the state p. 

Let pi, . . . ,pn be a sequence of density operators and 
let pi, . . . ,p„ be a probability distribution on {1, . . . , n}, 
i.e., pj > for all j and pj = 1. We say that a system 
S is a random sample from the ensemble {{pj}, {Pj}), iff 
system S is in state pj with probability pj. In this case, 
an observable A takes, in a precise measurement, the 
value in a Borel set A with probability 

P(A)=^p,Tr[ii;^(A)p,]. (6) 

i 

Let p = ^Ylij PjPj- By linearity of trace, the density oper- 
ator p satisfies 

P(A) = Tr[i?^(A)p]. (7) 
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Since A and A arc arbitrary, from Postulate III (Born 
statistical formula) we conclude the following. 

Theorem II. 1 Any random, sample from ensemble 

({ft}: {Pj}) «s in- the state p = J^jP^Pj- 

The above theorem has established the interpretation 
of the mixture of states that the system S is in the state 
ppi + (1 — p)/92, if it is in state pi with probability p and 
in state p2 with probability I — p. 

The notion of precise measurements of observables is 
determined solely by Postulate III (Born statistical for- 
mula) without assuming any further conditions on the 
state change caused by measurement such as the projec- 
tion postulate stating that the measurement projects the 
state onto the eigenspace corresponding to the eigenvalue 
to be obtained. 

Postulate III (Born statistical formula) does not as- 
sume that the observable has a certain unknown value in 
the state just before the measurement that is reproduced 
by a precise measurement, but only requires that the pre- 
cise measurement statistically reproduces the postulated 
probability. 

A Hilbert space is separable if and only if its dimension 
is at most countable infinite. Throughout this paper, 
only separable Hilbert spaces are considered and simply 
called Hilbert spaces. 

Throughout this paper, the word "Borel set" can be 
safely replaced by the word "interval" only with some 
modifications on mathematical technicality. Readers not 
familiar with measure theory are recommended to read 
with such replacements. 

The relation between the present formulation based on 
spectral projections due to von Neumann [6] and Dirac's 
formulation [36] is as follows. If the observable A has the 
Dirac type spectral decomposition 

A = ^^^|/x,z.)(/i,:/| + / ^A|A,z/)(A,i/|dA, 

where \x varies over the discrete eigenvalues, A varies over 
the continuous eigenvalues, and v is the degeneracy pa- 
rameter, then we have 

i;^(A) = / ^|A,!/)(A,HdA. 

In this case, we have 
Tr[i;^(A)p] 

Wc do not allow unnormalizable states such as the 
one described by Dirac's delta function, since they by 
no means define the normalized probability distribution 
of the output of every measurement consistent with the 
probability theory axiomatized by Kolmogorov [37]. 



B. Output probability distributions 

Every measuring apparatus has a macroscopic output 
variable that takes the output of each instance of mea- 
surement. The output variable is a random variable, in 
the sense of classical probability theory [37] , the probabil- 
ity distribution of which depends only on the input state, 
the state of the system to be measured at the instant just 
before the measurement. 

Let S be a quantum system, to be referred to the object, 
with state space H. Let A(x) be a measuring apparatus 
with output variable x to measure the object S. We as- 
sume that X takes values in the real line R. For any Borel 
set A in R, we shall denote by "x e A" the probabilistic 
event that the output x takes a value in A. The event 
"x e A" is called the outcome of measurement. The 
probability distribution of x on input state p is denoted 
by Pr{x S A||p}, where A varies over all Borel subsets of 
the real line, and called the output probability distribution 
of A(x). We shall write Pr{x e A||p} = Pr{x e A||^}, 
ifp= 

In this paper, any probability distribution is required 
to satisfy the positivity, the countable additivity, and the 
normalization condition [37], so that the output proba- 
bility distribution satisfies the following conditions. 

(i) (Positivity) Pr{x e Ajj/a} > for any Borel set A. 

(ii) (Countable additivity) 

Pr{xeA||p} = 5^Pr{xeA,||p} (8) 

j 

for any disjoint sequence of Borel sets Ai, A2, . . . with 
A = U,A,. 

(iii) (Normalization condition) Pr{x G R||/9} = 1- 

In addition to the above, it is natural to require that 
the output probability distribution should satisfy the fol- 
lowing postulate. 

Mixing law of output probability: For any appa- 
ratus A(x), the function p 1— > Pr{x e A||/9} is an affine 
function of density operators p for every Borel set A, i.e., 

Pr{xe A||ppi + (l-p)p2} 
= pPr{x e A||pi} + {l-p) Pr{x e A||p2}, (9) 

where pi and p2 are density operators and < p < 1. 

The above postulate is justified as follows. If the 
system S is a random sample from the ensemble 
({pi) P2}j {p, 1 ^p}), then the event "x G A" occurs with 
probability p Pr{x e A||pi} + {1 - p) Pr{x G A||p2}- On 
the other hand, from Theorem II. 1 in this case the sys- 
tem S is in the state ppi -|- (1 — p)p2, so that the above 
equality should hold. 

C. Probability operator valued measures 

In order to characterize the output probability distri- 
butions, we need a mathematical definition: A mapping 
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n : A n(A) of the collection B{IV) of Borcl subsets 
in R into the space C{7i) of bounded operators on Ti is 
called a probability operator valued measure (POVM), if 
the following conditions are satisfied: 

(i) (Positivity) n(A) > for all A e B(R). 

(ii) (Countable additivity) For any disjoint sequence 
Ai, A2, . . . of Borel sets with A = [J^ Aj, we have 

n(A) = ^n(A,), (10) 

j 

where the summation is convergent in the weak operator 
topology, i.e., we have {tp\U{A)\tp) = Ej(V-'|n(Aj)|i/;) for 
every state vector i/i. 

(iii) (Normalization condition) n(R) = /, where / is 
the identity operator on H. 

For mathematical properties of POVMs we refer to 
Berberian [38] . One of important consequences from the 
mixing law of output probability is the following char- 
acterization of output probability distributions given in 
Ref. [39]. 

Theorem II. 2 The mixing law of output probability is 
equivalent to the following requirement: 

For any apparatus A(x) there exists a unique POVM 
n satisfying 

Pr{x e A||p} = Tr[n(A)p] (11) 

for any Borel set A and density operator p. 

A sketch of the proof runs as follows. It is easy to 
check that Eq. (11) defines the output probability dis- 
tribution satisfying the mixing law of output probability. 
Conversely, suppose that the output probability distribu- 
tion satisfies the mixing law of output probability. Recall 
that every trace class operator p can be decomposed as 

4 

P = Yl,^oPj^ (12) 

i=i 

where pi, . . . ,p4 are density operators and ai, . . . ,04 are 
complex numbers; one of the decompositions can be eas- 
ily found from the spectral decomposition. By the above 
decomposition, for every trace class operator p and every 
Borel set A, we can define a complex number n(A, p) by 

4 

n(A,p)=^a, Pr{xe A||p,}. (13) 

The mixing law of output probability ensures that for 
every decompositions of the same p, the above equation 
defines the unique value, and moreover that the function 
n is linear in p. Let |1), |2), . . . be an orthonormal basis 
of H. Then, we can define an operator n(A) by 

n(A)=^n(A,|m)(n|)|n)(m|. (14) 

n,m 



Then, by linearity of n(A, p) in p, we have 

IV[n(A)p] = ^n(A,|m)(n|)(m|p|n) 

= ulA,J2{m\p\n)\m){n\\ 

= n(A,p) 

= Pr{x e A\\p}. (15) 

Thus, n(A) is a unique operator satisfying Eq. (11). 
Now, conditions (i) (iii) for II follow easily from Eq. (11), 
and hence II is a POVM. This completes the proof. 

The POVM n defined by Eq. (11) is called the POVM 
of apparatus A(x). The operator n(A) is called the effect 
of apparatus A(x) associated with the outcome x G A. 
For the general notion of effects, we refer to Kraus [40]. 
For applications of POVMs to quantum measurement, 
quantum estimation, and quantum information, we re- 
fer the reader to Helstrom [41], Davies [42], Holevo [43], 
Peres [44], and Nielsen-Chuang [34]. 

Let A be an observable of system S. Postulate III 
(Born statistical formula) naturally leads to the following 
definition. We say that apparatus A(x) satisfies the Born 
statistical formula (BSF) for observable A on input state 
p, if we have 

Pr{x e A||p} = Ty[E^{A)p] (16) 

for every Borel set A. The mapping E"^ that maps every 

Borel set A to the spectral projection E^{A) of A cor- 
responding to A is called the the spectral measure of A. 
For mathematical theory of spectral measures, we refer 
to Halmos [45] . From Postulate III (Born statistical for- 
mula), apparatus A(x) precisely measures an observable 
A if and only if A(x) satisfies the BSF for observable 
A on every input state, and moreover for every observ- 
able A of S there is at least one apparatus that precisely 
measures A. From Eqs. (11) and (16), apparatus A(x) 
precisely measures observable A if and only if the POVM 
n o/A(x) is the spectral measure E^, i.e., 

11 = E^. (17) 



D. Quantum state reductions 

We have shown that every apparatus is associated with 
a POVM which determines the output probability distri- 
bution. However, POVMs of apparatuses do not deter- 
mine the joint probability distributions of outputs from 
successive measurements using several apparatuses. In 
the following, we introduce the notion of quantum state 
reduction to determine such joint probability distribu- 
tions. 

Depending on the input state p and the outcome x G 
A, let /C{xeA} be the state just after the measurement 
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conditional upon the outcome x G A. We assume that 
for any Borel set A with Pr{x e A Hp} > the state 
P{xeA} is uniquely determined. If Pr{x G A||p} = 0, the 
state PjxeA} is taken to be indefinite and the notation 
P{xeA} denotes an arbitrary state. The state P{xeA} is 
called the output state given the outcome x e A on input 
state p. 

The state change from the input state to the out- 
put state is generally called the quantum state reduc- 
tion; while the transformation from the input state to the 
output probability distribution, namely the state of the 
macroscopic meter, is called the objectification or the ob- 
jective state reduction. Those two different notions have 
been mixed up for long time [46]. 

Two apparatuses are called statistically equivalent, if 
they have the same objective state reduction and quan- 
tum state reduction, or they have the same output prob- 
abilities and the same output states for any outcomes 
and any input states. 

E. Mixing law 

For notational convention, we distinguish apparatuses 
by their output variables. For instance, symbols A(x), 
A(y), and A(z) denote three apparatuses with output 
variables x, y, and z, respectively. 

The operational moaning of the state /0{xgA} is given 
as follows. Suppose that a measurement using the appa- 
ratus A(x) on input state p is immediately followed by a 
measurement using another apparatus A(y). Then, the 
joint probability distribution Pr{x G A, y g A'Hp} of the 
output variables x and y is given by 

Pr{xG A,yG A'IIp} 

= Pr{y e A'||p{xeA}} Pr{x G A||p}, (18) 

since the event x G A occurs with probability Pr{x G 

A||p} and then the event y G A' occurs with probability 
Pr{y G A'||p{xeA}}- We shall call the above joint proba- 
bility distribution the joint output probability distribution 
of A(x) and A(y). 

Thus, the joint probability distribution of outputs of 
successive measurements depends only on the input state 
of the first measurement and should satisfy the following 
postulate. 

Mixing law (of joint output probability): For 

any apparatuses A(x) and A(y), the function p i-^ 
Pr{x G A,y G A'||p} is an affine function of density 
operators p for every pair of Borel sets A, A', i.e., 

Pr{x G A, y G ^'\\ppi + (1 - p)p2} 
= pPr{xG A,yG A'IIpi} 

+ (l-p)Pr{xG A,yG A'||p2}, (19) 

where p\ and pi are density operators and < p < 1 . 
This requirement is justified as follows. The successive 

applications of two apparatuses A(x) and A(y) to a sin- 
gle system S can be considered as an application of one 



apparatus A(x, y) with two output variables x and y. 
Thus, the above postulate follows from the mixing law of 
output probability (generalized to apparatuses with two 
output variables). 

By substituting A' = R in Eq. (18) and using the 
normalization condition Pr{y G R||p{xeA}} = 1) we have 

Pr{x G A, y G R||p} = Pr{x G A||p} (20) 

for any A and p. Thus, we conclude that the mixing law 
of joint output probability implies the mixing law of out- 
put probability. From now on, the mixing law of joint 
output probability will be simply referred to as the mix- 
ing law. 

Consider the case where A = R. The symbol p in 
Pr{x G R, y G A'll/?} refers to the state just before A(x) 
measurement, while the symbol p in Pr{y G A'||/!)} refers 
to the state just before A(y) measurement. Thus, the 
above two probabilities arc not generally identical. Ac- 
cording to Eq. (18), we have 

Pr{xGR,yG A'llp} = Pr{y G A'||p{xeR}}. (21) 

The above relation characterizes the state P{xeR}- 

If Pr{x G A\\p} = 1, by the additivity of probability, 
we have 

Pr{xG A,y G A'IIp} = Pr{x G R, y G A'||p}, (22) 
and hence Eq. (18) leads to 

Pr{yG A'||p{xeA}} = Pr{y G A'Hp^xeR}}- (23) 

Since apparatus A(y) is arbitrary, we have P{xeA} = 
P{xGR}- Thus, the condition x G A makes no selection. 
In this case, the state change p i— > P{xeA} is called the 
nonselective state change. 

From Eq. (18), the conditional probability distribution 
of y given x G A is determined as 

Pr{y G A'|x G A||p} = Pr{y G A'||p{xeA}}. (24) 

provided that Pr{x G A||/9} > 0. In particular, we have 

Pr{y G A'|x G A|ip} = Tr[i?^(A')p{xeA}], (25) 

if A(y) precisely measures an observable B. The above 
relation uniquely determines the output state P{xeA}- 

F. Operational distributions 

In 1970, Davies and Lewis [30] introduced the following 
mathematical notion for unified description of statistical 
properties of measurements. A mapping I : A i— » X(A) 
of 6(R) into the space C{tc{TL)) of bounded linear trans- 
formations on the space tc{'H) of trace-class operators on 
Ti, is called an DL instrument, iff the following conditions 
are satisfied. 

(i) (Positivity) X(A) is a positive linear transformation 
of Tc{n) for every A G B{K). 
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(ii) (Countable additivity) For any disjoint sequence 
Ai, A2, . . . of Borel sets with A = |J^. Aj, we have 

j(A) = ;^j(A,-), 

j 

where the summation is convergent in the strong op- 
erator topology of £(rc(W)), i.e., lim„_>oo ||2^(A)p — 
X;"^i2:(Aj)/9||i-c = for any p G tc{H), where \\X\\rc = 

TrVXtX for any X € tc(H). 

(iii) (Normalization condition) 2r(R) is trace- 
preserving, i.e., 

Tr[J(R)] = Trp (26) 

for any p e tc{H). 

For mathematical properties of DL instruments we re- 
fer to Davies [42]. One of important consequence from 
the mixing law is the following unified characterization 
of output probability distribution and quantum state re- 
duction given in Ref. [47, 48]. 

Theorem II. 3 The mixing law is equivalent to the fol- 
lowing requirement: 

For any apparatus A(x) there exists a unique DL in- 
strument! satisfying 

I{A)p = Pr{x e A||p}p{,,gA} (27) 

for any Borel set A and density operator p. 

A sketch of the proof runs as follows. Let A(x) be an 
apparatus. For any state p and Borel set A, we define an 
operator X(A, p) by 

J(A, p) = Pr{x e A||p}p{,eA}. (28) 

If Pr{x e All/?} = 0, then J(A, p) = 0, so that X(A,p) 
is determined definitely, despite that jO{xGA} is indefinite 
in this case. Then, for any apparatus A(y) to precisely 
measure an observable B, we have 

Pr{xe A,y e A'IIp} 

= Tr[i;^(A')p{xeA}]Pr{xeA||p} 
= Tr[E^{A')I{A,p)]. 

Thus, by the mixing law, we have 

J(A,ppi + (l-p)p2) 

= pJ(A,pi) + (l-p)J(A,p2), (29) 

where pi and p^ are density operators and < p < 1. 
Thus, the definition of T{A, p) can be extended to all 
trace class operators p by the relation 

4 

^(A,/>) = J2ajI{A,pj), (30) 

for any density operators pi , . . . , p4 and complex num- 
bers ai,...,a4 such that p = 'Ylfj=i^3P3- Since every 



density operator has at least one such decomposition, 
and since Eq. (29) ensures the uniqueness of extension, 
the operator X(A, p) is well-defined for all Borel sets A 
and all trace class operators p. Then, we can see that 
the mapping that maps p to I(A, p) is a bounded linear 
transformation of Tc{Ti) for every Borel set A. We denote 
this mapping by 1(A). Then, we define T as the mapping 
that maps A to 2r(A). Now, we have only to show three 
properties (i)-(iii) above; however, these arc easy con- 
sequences from the positivity, countable additivity, and 
normalization condition of the probability distribution 
Pr{x e A\\p}. For the detail, sec Rcfs. [47, 48, 49]. 

The mapping X(A) given above is called the operation 
of apparatus A(x) associated with the outcome x e A. 
The mapping X is called the operational distribution of 
apparatus A(x). Then, the output probability and the 
output state can be expressed by 

Pr{x e A\\p} = Tr[J(A)p], (31) 
- Tr[J(A)p]' 

where the second relation assumes Pr{x G A||p} > 0. 
Thus, if 2x and Jy are the operational distributions of 
A(x) and A(y), respectively, then the joint output prob- 
ability distribution can be expressed by 

Pr{x e A, y e A'llp} = Tr[Jy(A')Xx(A)p] (33) 

for any state p and any Borel sets Ai, A2. 

Both the output probability distribution and the out- 
put states are determined by the operational distribution. 
Thus, two apparatuses are statistically equivalent if and 
only if they have the same operational distribution. 

Let us consider three apparatuses A(x), A(y), A(z) 
with operational distributions Xx,2y,Xz, respectively, 
and suppose that in a state p, these three apparatuses 
are applied to the system S successively in this order. 
Then, the joint probability distribution of three outputs 
X, y, z are given by 

Pr{xe A,ye A',ze A"||p} 

= Pr{y e A', z e A"||p{xeA}} Pr{x G A||p} 
= Tr[X,(A")Xy(A')p{xeA}]Pr{xe A||p} 
= Tr[X,(A")Jy(A')Jx(A)p]. 

Thus, by mathematical induction, we obtain the relation 

Pr{xi G Ai,...,x„ e A.a\\p} 

= Tr[X„(A„)...Xi(Ai)p] (34) 

for the joint probability distribution of the output vari- 
ables xi , . . . , x„ of the successive measurements on the 
initial input state p using apparatuses A(xi), . . . , A(x„) 
in this order with operational distributions Ii,...,2'„, 
respectively. Thus, joint probability distribution of the 
output variables in any successive measurements are de- 
termined by the operational distributions of apparatuses. 
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so that statistically equivalent apparatuses are mutually 
exchangeable without affecting the joint probability of 
their outcomes. 

In this subsection, under the mixing law, we have 
shown that statistical properties of every apparatus are 
described by a DL instrument and that two apparatuses 
are statistically equivalent if and only if they corresponds 
to the same DL instrument. 



G. Duality 

For any bounded linear transformation T on TciTi), 
the dual of T is defined to be the bounded linear trans- 
formation T* on CiJ-L) satisfying 

Tv[A{Tp)] = Tt[(TM)p] (35) 

for any A e £{71) and p G tc{'H). The dual J(A)* of the 
operation 1(A) is called the dual operation associated 
with X e A; by Eq. (35) it is defined by the relation 

Tr[^(A)p]=Tr{[J(A)*A]p} (36) 

for any A e C{H) and p G tc{H). 

The operator I{A)*I obtained by applying the dual 
operation 1(A)* to the identity operator is called the 
effect of operation X(A). By Eq. (31) and Eq. (35) we 
have 

Pr{xG A||p} = Tr{[I(A)*/]p}. (37) 

Since p is arbitrary, comparing with Eq. (11), we have 

n(A)=X(A)*/ (38) 

for any Borel set A. Thus, the POVM of A(x) is deter- 
mined by the effects of the operational distribution I. 

Let 2x and Ty be the operational distributions of A(x) 
and A(y), respectively, and let Ily be the POVM of 
A(y). Then, we have 

Tr[2-y(A')Jx(A)p] = Tr{[Jy(A')*/][Tx(A)p]} 
= Tr{[ny{A')[MA)p]} 
= Tr{[J(A)*ny(A')]p}. (39) 

Thus, the joint output probability distribution can be 

expressed by 

Pr{x e A, y G A'llp} = Tr{[I(A)*ny(A')]p} (40) 

for any A, A' G ^(R). 

Given the operational distribution I of an apparatus 
A(x), the operation T = X(R) is called the nonselective 
operation of apparatus A(x) and T* = I(R)* is called 
the nonselective dual operation of apparatus A(x). The 
nonselective operation T is trace-preserving, i.e., 

Tr(Tp) = Trp (41) 

for any trace-class operator p, while the nonselective dual 
operation T* is unit-preserving, i.e., 

T*I = I. (42) 



H. Individual quantum state reductions 

It is natural to assume that the output variable x can 
be road out with arbitrary precision. It follows that each 
instance of measurement has the output value x = x. 
Let P{x=x} be the state of the system S at the time just 
after the measurement on input state p provided that the 
measurement yields the output value x = x. The indi- 
vidual quantum state reduction caused by the apparatus 
A(x) is the state change p i— > /9{x=x} for any real number 
X. The state p{x=x} is called the output state given the 
output X = X on input state p. 

For distinction, we shall call the previously defined 
quantum state reduction p i— > /0{xeA} as the collective 
quantum state reduction. 

If Pr{x G {a;} Hp} > 0, the state P{x=a;} is determined 
by the relation 

P{x=x} = P{xe{:r}}- (43) 

However, the above relation determines no P{x=a;}) if tli6 
output probability is continuously distributed. In order 
to determine states P[x=x}j the following mathematical 
notion was introduced in Ref. [50]. A family {p{x=a;}| x G 
R} of states is called a family of posterior states for a 
DL instrument T and a prior state p, if it satisfies the 
following conditions. 

(i) The function x i-^ P{x=x} is Borel measurable. 

(ii) For any Borel set A, we have 

I{A)p= [ p[^^,yTr[dIix)p]. (44) 

J A 

It was shown in Ref. [50] that for any DL instrument I 
and prior state p, there exists a family of posterior states 
uniquely, where two families are taken identical, if they 
differ only on a set A such that Tr[I(A)p] = 0. 

We define the individual quantum state reduction to be 
the correspondence from the input state p to the family 
{P{x=x}| x G R} of posterior states for the operational 
distribution I of A(x) and prior state p. 

According to the above definition, the individual quan- 
tum state reduction and the collective quantum state re- 
duction are related by 

" Pr{x g A||p} ^^"=^> ^""^"^ ^ ^^^^ 

Thus, the individual quantum state reduction and the 
collective quantum state reduction are equivalent under 
Eq. (45). 

The operational meaning of the individual quantum 
state reduction is given as follows. Suppose that a mea- 
surement using the apparatus A(x) on input state p is 
immediately followed by a measiircmcnt using another 
apparatus A(y). Then, the joint probability distribution 
Pr{x G A,y G A'||/9} of the output variables x and y 
is given by Eq. (18). The conditional probability distri- 
bution of y given x G A is defined in probability theory 
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by 



Pr{y e A'|x G A} 



Pr{x € A,y e A^IIpI 
Pr{x e A||p} ' 



(46) 



However, this definition does not cover the conditional 
probabihty distribution of y given x = x, since it may 
happen that Pr{x G = for every x. To avoid 

this difficulty, in probability theory the conditional prob- 
ability distribution of y given x = a; is defined as the 
function x Pr{y S A'|x = x\\p} satisfying 

Pr{xe A,ye A'IIp} 

= j Pr{ye A'l x = x||/9}Pr{xeda;|lp}. (47) 

From Eqs. (45) and (46), we have the following charac- 
terization of the individual quantum state reduction, 

Pr{y e A'|x = x\\p} = Pr{y e A'||p{^=,}}. (48) 

Thus, the individual quantum state reduction is deter- 
mined by the conditional probability distribution of the 
output of any succeeding measurement conditional upon 
the individual output. 



I. Extendability postulate 

In the previous discussions, under a sole hypothesis, 
the mixing law, we have shown that statistical proper- 
ties of every apparatus are described by a DL instrument 
and that two apparatuses arc statistically equivalent if 
and only if they corresponds to the same DL instrument. 
Consequently, the set of statistical equivalence classes of 
apparatuses are considered to be a subset of the set of DL 
instruments. In the above sense, "apparatus" denotes a 
physical system for measurement and "DL instrument" 
is intended to denote the function of an apparatus or to 
mathematically denote the statistical equivalence class of 
an apparatus; of course, we consider that two apparatuses 
have the same function if and only if they are statistically 
equivalent. However, up to this point, some DL instru- 
ments represent statistical equivalence classes of appara- 
tuses, but some of them may not. In this subsection, we 
shall eliminate physically irrelevant DL instruments by 
another physically plausible requirement. 

Wc deal with any apparatus A(x) as a mathematical 
description of a physical system which has a macroscopic 
variable x to measure a quantum system S. However, 
even if we sufficiently specify the physical entity of the 
measuring apparatus described by A(x), an ambiguity 
still remains as to what is the system to be measured. 
For example, let S' be another quantum system which is 
remote from both S and A(x). Then, wc always make the 
composite system S-l-S'. Since we identify an observable 
^ of S with the observable ^4 (g) / of S -|- S', the physical 
apparatus measuring an observable A of the system S is 
also considered as the one measuring the observable A(^I 



of the extended system S + S'. Thus, every real appara- 
tus has the property that if it is described to measure a 
system S, then it is also described to measure the triv- 
ially extended system S-l-S'. The above consideration 
naturally leads to the following postulate. 

Extendability postulate: For any apparatus A(x) 
measuring a system S and any quantum system S' not 
interacting with A(x) nor S, there exists an apparatus 
A(x') measuring system S-l-S' with the following statis- 
tical properties: 

Pr{x' e A||p®p'} = Pr{xGA|jp}, (49) 

(p «) p'){x'eA} = P{x6A}«ip' (50) 

for any Borel set A, any state p of S, and any state p' 
ofS'. 

In order to obtain a mathematical condition charac- 
terizing the models satisfying the above requirement, we 
need the following mathematical notions. Let C" be the 
Hilbert space of n-dimensional vectors. Since every linear 
operator on a finite dimensional space is bounded and of 
finite trace, we have tc(C") = £(C"). Then, the space of 
trace class operators on the tensor product Hilbert space 
7^ ® C" is decomposed as tc{H ® C") = tc{H) ® £(C"). 
Thus, any linear transformation T on rc{H) can be ex- 
tended naturally to the linear transformation T idn on 
Tc{H O C") by 

(T ® idr,){J2 Po ® Pi) = E '^^Pi) ® P'j (51) 

j j 

for any pj G tc(H) and p'^ G £(C"). Then, T is called 
completely positive ( CP), if T (g) idn maps positive oper- 
ators in Tc{'H ® C") to positive operators in tc{'H ® C") 
for any positive integer n. A DL instrument X is called 
a completely positive ( CP) instrument, if T{A) is CP for 
every A. Completely positive maps on C*-algebras were 
introduced by Stinespring [51], CP operations were in- 
troduced by Kraus [52] , and CP instruments were intro- 
duced in Ref. [32]. For the general theory of CP maps 
we refer to Takesaki [53] . 

Theorem II. 4 Under the mixing law, the extendability 

postulate implies the following requirement: 

The operational distribution of every apparatus should 
be a CP instrument. 

The proof runs as follows. Let A(x), S, and S' be those 
given in the extendability postulate. The state space of S 
is denoted by H. and the state space of S' is supposed to 
be C". It has been proven that under the mixing law, ev- 
ery apparatus has its own operational distribution. Let I 
be the operational distribution of A(x). By the extend- 
ability postulate, there is an apparatus A(x') satisfying 
Eqs. (49) and (50). The mixing law ensures the existence 
of the operational distribution I' of the apparatus A(x'). 
Then, we have 

I'{A){p®p') = Pr{x'G A||p®p'}(p(gp'){,,eA} 
= Pr{xG A||p}p{xeA}®p' 
= [I{A)p]®p'. (52) 
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It follows that the operation X'( A) of the extended appa- 
ratus A(x') associated with A is represented by X'(A) = 
X(A) <S> idn- Then, by the positivity of operation X'(A), 
X(A) (g) idn should be a positive linear transformation on 
Tc{H (g) C"). Since n is arbitrary, we conclude that X(A) 
is completely positive, so that the proof is completed. 

The transpose operation of matrices in a fix basis is a 
typical example of a positive linear map which is not CP 
[34]. Lot T be a transpose operation on Tc(7i) = C{Ti.) 
for H = C", and let /z be any probability measure on R. 
Then, the relation 

I{A)p = ^i{A)T{p) (53) 

for any Borel set A and any operator p defines a DL 
instrument. However, since T is not CP, the operation 

X(A) is not CP, so the T is not a CP instrument. The 
extendability postulate implies that there is no apparatus 
corresponding to the above DL instrument. 

As a consequence of the whole argument of this section, 
we have reached the following conclusion: 

For any apparatus A(x) there exists a unique CP in- 
strument X such that the output probability distribution 
and the quantum state reduction are described by 

Pr{xeA||p} = Tr[X(AV], (54) 

p^pi-^^y = ^^m ^^^^ 

for any Borel set A and any input state p, where the 
second equality assum,cs Tr[!r(A)/C)] > 0. 

We have posed two plausible requirements, the mixing 
law and the extendability postulate, as a set of neces- 
sary conditions for every apparatus to satisfy. Under 
these conditions, we have shown that every apparatus 
corresponds uniquely to a CP instrument, called the op- 
erational distribution, that determines the output prob- 
ability distributions and the quantum state reduction. 
Thus, the problem of determining all the possible quan- 
tum measurements is reduced to the problem as to which 
CP instrument corresponds to an apparatus. This prob- 
lem will be discussed in the next section and it will be 
shown that every CP instrument corresponds to at least 
one apparatus. Thus, the statistical equivalence classes 
of all the possible measuring apparatuses are described 
in one-to-one correspondence by the CP instruments. 



III. MEASURING PROCESSES 

A. Indirect meeisurement models 

The disturbance on the object caused by a measure- 
ment can be attributed to an interaction, called the m,ea- 
suring interaction, between the object and the appara- 
tus. In this section, we shall consider indirect measure- 
ment models in which the measuring interactions are sub- 
ject to the equations of motions in quantum mechanics 



[47, 54] and show that even though the indirect mea- 
surement models are only a subclass of all the possible 
quantum measurements, every measurement is statisti- 
cally equivalent to one of indirect measurement models. 

Let A(x) be a measuring apparatus with macroscopic 
output variable x to measure the object S. The measur- 
ing interaction turns on at time t, the time of measure- 
ment, and turns off at time t-\- At between object S and 
apparatus A(x). We assume that the object and the ap- 
paratus do not interact each other before t nor after t-\- At 
and that the composite system S-I-A(x) is isolated in the 
time interval {t,t + At). The probe P is defined to be the 
minimal part of apparatus A(x) such that the composite 
system S -|- P is isolated in the time interval {t,t -\- At). 
By minimality, we naturally assume that probe P is a 
quantum system represented by a Hilbert space K,. De- 
note by U the unitary operator on Tig) /C representing the 
time evolution of S + P for the time interval (t, t + At). 

At the time of measurement the object is supposed to 
be in an arbitrary input state p and the probe is supposed 
to be prepared in a fixed state a. Thus, the composite 
system S -I- P is in the state p ® cr at time t and in the 
state U [p® (t)U'^ at time t + At. Just after the measuring 
interaction, the object is separated from the apparatus, 
and the probe is subjected to a local interaction with the 
subsequent stages of the apparatus. The last process is 
assumed to measure an observable M, called the probe 
observable, of the probe, and the output is represented 
by the value of the output variable x. The above mea- 
surement of M is assumed to be local, in the sense that 
the measuring apparatus for the measurement of M in- 
teracts with the probe P but does not interact with the 
system S [54]. 

The measuring process of the apparatus A(x) de- 
scribed above is thus modeled by the state space JC of 
the probe P, the initial state cr of P, the time evolution 
operator /7 of S -I- P, and the probe observable M. 

In order to develop the theory of measuring pro- 
cesses described above, we define an indirect measure- 
ment model to be any quadruple (/C, a, U, M) consisting 
of a Hilbert space /C, a density operator cr on /C, a uni- 
tary operator C/ on W /C, and an observable M on K. 
An apparatus A(x) is said to be described by an indirect 
measurement model {IC,a,U, M), if the measuring pro- 
cess of A(x) admits the above description with the state 
space K, of the probe, the initial state a of the probe, the 
time evolution operator U of the object plus probe dur- 
ing the measuring interaction, and the probe observable 
M. Two indirect measurement models {JC,a,U, M) and 
(/C', <y' ,U' , M') are said to be unitarily equivalent, if there 
is a unitary transformation hi from K, onto K,' such that 
CT = Wa'U, U = {I® W)a'{I ® U), and M = WM'U. 
We shall not distinguish two unitarily equivalent models, 
since they may describe the same physical system. An in- 
direct measurement model (/C, a, U, M) is called pure, if a 
is a pure state; we shall write (/C, a, U, M) = (/C, ^, U, M), 

In an indirect measurement model, the role of the mea- 
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suring interaction is well characterized as a transducer, 
and the subsequent stages as an amplifier. In the Stern- 
Gerlach measurement of the z-component of spin, the 
object system models the spin-degree of freedom of the 
particle, the probe models the orbital-degrees of freedom 
of the particle, and the probe observable corresponds to 
the 2:-component of the linear momentum of the parti- 
cle. Moreover, the amplification process models the free 
orbital-motion plus the interaction with a detector, and 
the output variable corresponds to the z-coordinate of 
the position of the detector that captures the particle. 

B. Output probability distributions 

Let A(x) be an apparatus with indirect measurement 
model (/C, ct, U, M) . Since the outcome of this measure- 
ment is obtained by the measurement of the probe ob- 
servable M at time t + At, by the BSF for observable M 
on input state U{p® a)U^ the output probability distri- 
bution of A(x) is determined by 

Pr{x e A||p} = Tr{[/ O E^(A)]f7(p (g) a)U''}. (56) 

By linearity of operators and the trace, it is easy to 
check that the output probability distribution of A(x) 
satisfies the mixing law of output probability. Thus, by 
Theorem II.2 there exists the POVM U of A(x). To 
determine 11, using the partial trace operation Tr;c over 
/C we rewrite Eq. (56) as 

Pr{x e All/)} = Tr[TrK{U^[I (g) £;^(A)]C/(7 (g) a)}p]. 

(57) 

Since p is arbitrary, comparing Eqs. (11) and (57), 
POVM of A(x) is determined as 

U{A)=TT,c{U^[I(®E'^{A)]UiI(®a)} (58) 

for any Borel set A. 

C. Conditional expectation 

Now we shall introduce a convenient mathematical no- 
tion to deal with such formulas as Eq. (58). 

Let TC and JC be two Hilbert spaces and let cr be a 
density operator on fC. For any C G £(H (g) /C), we define 
the operator £cr{C) G t^{H) by the relation 

£„{C)=TvK.[C{I®a)]. (59) 

The operator £a{C) G ^(H) is called the conditional ex- 
pectation of C in (T. 

The conditional expectations have the following prop- 
erties easily obtained from the properties of partial trace 
operation. 

(i) For C = Y.iAi®Bi€ C{n ® JC), 

£{C) =J2^[BM^i- (60) 



(ii) For any C G C{H O K) and p G Tc{n), 

TT[£,{C)p]=Tr[C{p®a)]. (61) 

(iii) For any A,D G C{n) and B,Eg C{K), 
£„[{A®B)C{D®E)]=A£„[{I®B)C{I®E)]D (62) 

(iv) The transformation C i— > £a{C) is a linear trans- 
formation from C{T-C ® IC) to CCH). 

(v) If C > 0, then £a{C) > 0. 

Mathematically, Eq. (61) shows that the transforma- 
tion C 1-^ £(y{C) is the dual of the trace-preserving 
CP map p ^ p ® a from TciTL) to rc(H (g JC). Thus, 
C £{C) is a unit-preserving CP map from jC{H ® K.) 
to £{n). 

If cr is a vector state such that a = we shall write 

£^{C) = £a{C) and call it the conditional expectation of 
C in ^. In this case, we have 

(V|f£(C)|V) = (V'®€|C|V®0 (63) 

for any ij) gH. Thus, we shall also write 

£^{C) = (CjCjO. (64) 

From Eq. (58), the POVM of apparatus A(x) with 
indirect measurement model (/C, ct, U, M) is the condi- 
tional expectation of the spectral measure of the observ- 
able M{t + At) in the state cr, i.e., 

n(A) = £^[E^^'+^*\A)] (65) 
= £,{U^[I®E''{A)]U}. (66) 

D. Quantum state reductions 

Since the composite system S -|- P is in the state 

U{p (g (j)W at time t + At, it is standard that the ob- 
ject state at time t + At is obtained by tracing out the 
probe part of that state, and, in fact, this rule is justified 
by Postulate IV (Composition rule) in Subsection II A. 
Thus, the nonselective state change is determined by 

p^ p' ^TTK:[U{p®a)U^]. (67) 

In order to determine the quantum state reduction, 
suppose that at time t + At the observer would locally 
measure an arbitrary observable B of the same object 
S. Let A(y) be an apparatus with output variable y 
to make a precise measurement of B . Since both the 
M measurement on P and the B measurement on S at 
time t + At are local, the joint probability distribution 
of their outputs satisfies the joint probability formula for 
the simultaneous measurement of /(gM and B®I in the 
state U{p®a)U'' [54]. 

It follows that the joint output probability distribution 
of A(x) and A(y) is given by 

Pr{xG A,y e A'IIp} 
= Ti{[E^{A')®E^\A)]U{p®a)U^}. (68) 
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Thus, using the partial trace Tr/c we have 

Pr{xe A,y e A'IIp} 
= Tr[E''{A')TrK{[I^E^{A)]U{p^a)U''}]. (69) 

On the other hand, from Eq. (18) the same joint output 
probability distribution can be represented by 

Pr{xe A,y e A'IIp} 

= Tr[i?^(A')p{x6A}]Pr{xG A||p} 

= Tt[i;^(A')Pr{xeA||p}p{,eA}]. (70) 

Since B and A' are chosen arbitrarily, comparing 
Eqs. (69) and (70), we have 

Pr{x e A||p}p{,eA} = Tr;c{[/ ® E^{A)]U{p a)U^ 

(71) 

Prom Eq. (56), the state P{xeA} is uniquely determined 
as 



P{xeA} 



Tr;c{[/ E^{A)]U{p ^ a)W} 
Tr{[I O EM(A)]U{p ® a)m} 



(72) 



for any Borcl set A with Pr{x e A||p} > 0. 

The above formula was obtained in Ref . [32] . It should 
be noted that Eq. (72) does not assume such an illegit- 
imate use of the projection postulate as assuming that 
the composite system S + P with the outcome x e A is 
in the state 



„S+P 



^{xeA} 



[I O E^'{A)]U{p O a)W[I O E^'{A)] 
Tr{[I O E'^(A)]U{p O a)W} 



(73) 



just after the measurement. It is true that the state 
'^{xeA} Is^ds to the same conclusion by defining P{xeA} = 
Tr^[/9|^^|], but such an assumption is by no means cor- 
rect, since for any partition A = A'U A" the state Pj^^j 
should be a mixture of P^^^^,y and P^xeA"} 
not the case for Eq. (73). It is a significant merit of our 
derivation of Eq. (72) to make no assumptions on the 
state of the composite system S + P after the measure- 
ment. 



E. Operational distributions 

In the previous subsection, we have confined our at- 
tention to the case where the measurement using A(x) 
is followed by a precise measurement of an observable. 
Now, we generally suppose that at time t + At the ob- 
server would locally measure the same system S by an 
arbitrary apparatus A(y). We shall show that the joint 
output probability distribution of A(x) and A(y) sat- 
isfies the mixing law. Let Ily be the POVM of A(y). 
Under the condition that the measurement of A(x) leads 
to the outcome x e A, the state at time t + At is p{xeA}- 



It follows from Eq. (71) that the joint output probability 
distribution is given by 

Pr{xe A,ye A'IIp} 
= Pr{yG A'||p|xeA}}Pr{xe A||p} 
= Tr[ny(A')Pr{xeA||p}p{,,,A}] 
= Tr{[Uy{A')^E^{A)]Uip^a)U^. (74) 

By linearity of operators and the trace, it is easy to 
check that the joint output probability distribution of 
A(x) and A(y) satisfies the mixing law. Thus, by The- 
orem 11.3 there exists the operational distribution I of 
A(x). 

By Eq. (71) the operational distribution I is deter- 
mined by 

J(A)p = TVK:{[/®£^(A)]?7(p®a)f/t} (75) 

for any Borel set A and any state p. From the above re- 
lation, it is easy to see that I satisfied the complete pos- 
itivity; as an alternative characterization, it is well-know 
that a linear transformation T on tcCH) is completely 
positive if and only if 



(76) 



for any finite sequences , . . . , ^„ G H and pi , . . . , p„ e 
Tc(T-{) [53] . Thus, we conclude that the operational distri- 
bution of any apparatus with indirect measurement model 
{JC, a, U, M) is a CP instrument. 

Let X be the operational distribution of an appara- 
tus A(x) with indirect measurement model {JC, a, U, M). 
Prom Eq. (75), the nonselective operation T = X(R) is 
represented as 



Tp = TVK:[C/(p®a)f/t]. 



(77) 



For any bounded operator A G CiTi), trace-class opera- 
tor p G Tc{H), and Borel set A G S(R), we have 

Tr[AI{A)p\ 
= Ti{ATifc{[I'»E^HA)]U(p(E)cr)U^) 
= Ti{[A^ E^ iA)]Uip ® a)U^ 
= Tr{U^[A^E^{A)]U{p^a)} 
= Tr{Tr!c{U^[A^E^{A)]U{I^a)}p). (78) 

Thus, from Eqs. (36) and (78) we have 

Tr{[J(A)M]p} 

= TriTTK{U^[A(S) E^ iA)]UiI ® a)}p) 

= Tv{£„{U^[A®E^{A)]U}p). (79) 

Since p is arbitrary, we have 

I{AyA = £^{U^[A^E^{A)]U} (80) 

for any bounded operator A G jO.{'H). In particular, the 
POVM n of A(x) satisfies 



n{A) = £^{U^[I(^ E^ {A)]U} 



(81) 
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for any Borel set A and the nonselective dual operation 
T* satisfies 

T*A = £„{U'<[A(E)I]U} (82) 
for any bounded operator A. 



F. Canonical Measurements 

In this section, we shall consider a model which has 

been considered to describe a typical measuring process 
for an arbitrary observable [8, 55]. Let A(x) be an ap- 
paratus to measure an observable A of the object S de- 
scribed by a Hilbcrt space H. The measuring process 
of A(x) is described by an indirect measurement model 
(/C, (J, U, M) as follows. The probe is modeled by a mass 
of one degree of freedom with position q and momentum 
p, so that the Hilbert space fC is the space of wave 
functions on R, i.e., /C = L^(R). The measuring inter- 
action is turned on in the time interval {t, t -\- At) that 
couples A and p, so that the total Hamiltonian in the 
time interval {t, t + At) is given by 

Hs+-p=Hs<^I + I<^Hy>+KA(^P, (83) 

where Hs and iJp are free Hamiltonians of the object and 
the probe, respectively, and K is the coupling constant. 
We suppose that the coupling is so strong that we can 
neglect the free evolutions and the duration At is so small 
as to satisfy KAt = 1. Thus, the time evolution of the 
composite system S + P during the measuring interaction 
is given by 

U = exp(^^Ai^pj . (84) 

After the measuring interaction, the apparatus makes a 
precise measurement of the position q of the probe to 
output the measurement result. Let ^ be the state vector 
of the probe at the time of the measurement. Then, the 
above apparatus is modeled by the indirect measurement 
model A4{A,S^) defined by 

M{A,0 = {L^R),^,exp{-iA^p/n),q) . 

We shall call this model the canonical model with ob- 
servable A and probe state ^. In what follows, we shall 
denote its operational distribution by and its POVM 
by n^, respectively. 

The Schrodinger equation for the wave function \E'(+r G 
W(g)L^(R) in the time interval t < t+r < t+At becomes 

OT oq 
Now assume the initial condition 

M^t(9)=^(9)V', (86) 



where ip G H is a state vector of the measured system, 
and the vector valued function ^{q)tp represents the ten- 
sor product ip £, in H L^(R). The solution of the 
Schrodinger equation is given by 

*t+r(9) = ^{ql - rKA)i,, (87) 

where I is the identity operator on H. For r = At, we 
have 

*t+At(«)=^(5/-A)V- (88) 

In order to determine the operational distribution of 
this measurement, we first obtain the following useful 
general result. 

Theorem III.l For any unitary operator U on H ® 

L^(R), the indirect measurement model (L^(R), ^, Z7, ^) 
has the operational distribution T determined by 

J(A)|V)(V|= [ \Ui^®0{Q))mi^^O{Q)\dq (89) 

J A 

for any input state tp and Borel set A. 

A formal proof using the Dirac notation runs as follows. 
Let V e W and A e B(R). Then, we have 

T(A)|V)(V| 

= TVk;{[/®£«(A)]|[/(V®0)W®OI} 

= Tr;c{[/® / |9)(9|d9]|C/(^®0)W®OI} 

J A 

= [ Tv,c{I(^\q){q\\U{^P(^0}{U{i^(^0\}dq 

J A 

= [ {q\U{i>®0){U{^^0\q)dq 

J A 

= [ m^®0{q)){u{^®0{q)\dq. 

J A 

Thus, we obtain Eq. (89). 

The statistics of the canonical model A4 {A, ^) is deter- 
mined by the operational distribution Xj. Prom Eq. (88), 

\u{i;<s>0iq)){u{i^<8>0{q)\ 

= \aqI-A)^){aqI-A)^\ 
= ^{qI-A)\^){mqI-Ay 

for any il) G H, and hence from Eq. (89), 

Jj(A)|V')(VI = / aqi - Ammiqi - dq. (90) 

J A 

By linearity and continuity, we obtain 

Ii{A)p= [ i{qI-A)pi{qI-A)Uq. (91) 
J A 

Then, the dual operational distribution is given by 

I^{A)*X = ^{ql - A)^X^{qI - A) dq, (92) 
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where X is an arbitrary bounded operator. The associ- 
ated POM IIj is given by 



n5(A) = ^|e(9/-A)|2rfg. 



(93) 



It follows that the output probability distribution is given 

by 

Pr{qeA||p}= / Tr[|C(g/-A)|V]dg (94) 

for any input state p. From Eq. (91), the output state 
given the output q = (7 is obtained as 



_ ^{qI-A)paqI-A)^ 
P{q=9} 'TT[\^{qI-A)\^p] ■ 



(95) 



If the input state is a vector state p = \tl)){ip\, we also 
have 



Pr{qe AIIV'} 



mqI-A)i;fdq 



(96) 



and the output state is also the vector state P{q=q} = 
l^{q=g})(V'{q=«}l such that 



V'{q=g} 



(97) 



By the function calculus of self-adjoint operator A, we 

have 

/ \^{qI-A)\'dq 

J A 

= [ dE^{X) [ XA{qmq-X)?dq, (98) 

JR JR 

where XA is the characteristic function of the Borel set 
A, i.e., xa{x) = 1 if a; G a and xa{x) = if x ^ A. Let 
f{q) = |^(-5)|2. From Eq. (93), we have 



ni(A) = / (/ 

JR 



*XA)(A)d£^(A), 



where / * XA is the convolution, i.e.. 



(/*Xa)(A) 



/ /(9)Xa(A- 



q) dq. 



(99) 



(100) 



Thus, if the initial state ^ of the probe goes to the position 

cigcnstatc \q = 0), the initial position density function 
|^(g)P and f{q) approaches to the Dirac delta function, 
so that the effect n^(A) approaches to the spectral pro- 
jection £''^(A) of the observable A. Similarly, if ^ goes to 
the position eigenstate, the output state P{ci=q} goes to 
the eigenstate of the observable A corresponding to the 
output of measurement. Thus, the model A4{A,^) de- 
scribes an approximately precise measurement of A that 
leaves the object in an approximate cigcnstatc of A cor- 
responding to the output. For the notion of approximate 
eigenvectors, see Halmos [45]. For the detailed discus- 
sion on the statistical properties of the model A4{A,^), 
we refer to Ref. [55]. 



G. Realizability postulate 

In the preceding section, we have considered the re- 
quirements that every measuring apparatus should sat- 
isfy. However, no postulates were posed as to what 
measuring apparatus exists, except for Postulate II 
(Schrodingcr equation) requiring that for any observable 
there is at least one apparatus to make a precise mea- 
surement of that observable. 

Here, we introduce a postulate that allows to construct 
another measuring apparatus from the apparatus allowed 
by Postulate II (Schrodinger equation). 

Realizability postulate. For any indirect measure- 
ment model (/C, ct, U, M), there is an apparatus A(x) de- 
scribed by (/C, (J, U, M). 

From the above postulate and the statistics of an ap- 
paratus with indirect measurement model, we conclude 
that for any indirect measurement model {IC, (7,U, M) , 
there is an apparatus with the operational distribution X 
such that 

Z(A)p = Tt^{[I ^ E^iA)]Uip ® a)U^ (101) 

for any input state p. 

The above postulate is justified by the assumption that 
our quantum systems obey no supcrselection rules. The 
argument runs as follows. By our assumption, every ob- 
servable A admits a precise measurement, so that we 
can assume that there is at least one plausible model 
of a measuring apparatus for the measurement of A. Al- 
though this is related to a long standing controversy on 
the measurement problem, for the simplicity of the cou- 
pling, the model M{^, A) has been considered be the first 
one to be plausible [6, 8]. Now, we shall argue that the 
realizability of any other indirect measurement model is, 
in principle, as feasible as the realizability of Ai{£,,A). 
From the negation of any nontrivial supcrselection rules, 
any self-adjoint operator corresponds to an observable 
and any density operator corresponds to a state, so that 
we can prepare P in cr and measure M within a given ex- 
perimental error limit. Thus, we have only to show that 
the unitary operator U is realizable. Since any unitary 
operator can be represented by an exponential of some 
observable, we can find an observable A and a parameter 
6 such that U = e~^^^. Then, in order to realize the 
model {JC, cr, U, M), we can follow the following steps: 

(i) Prepare the probe P in the state a at time t. 

(ii) Prepare the model M{A, ^) in the state ^ near the 
momentum eigenstate |^) = \p = 9) at time t. 

(iii) Couple the composite system S -|- P to the model 
M{A,0- 

(iv) At the time t + At, the coupling with the model 
M{A,£) is turned off, and the observer measures the 
probe observable M . 

Now, let p be the input state to the model [JC, a, U, M). 
If the state ^ were such that ^ = |^), the coupling be- 
tween the composite system S+P and the model A4 {A, ^) 
changes the state of the S-\-P from p® a to U {p® a)U\ 
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by the relation 

exp(-iA(g)p)(p(g) fT \e){9\)exp{-iA^p)^ 
= cxp{-i0A){p a) exp(-iM)+ O \e){e\ 
= U{p<^a)U^ <»\9}{0\. 

Thus, the above procedure reahzes the model 
(/C, a, U, M) within the given error hmit, if the state 
preparation is sufficiently near to the eigenstate \'p = 6). 
Thus, any indirect measurement model can be realized, 
in principle, by a physical system under a given unit 
system and in a given experimental error limit. This 
supports the realizability postulate. 

In the preceding subsection, we concluded that the 
operational distribution of any apparatus with indirect 
measurement model {IC,a,U,M) is a CP instrument. 
The converse of this assertion was proven by Ref. [32, 33] 
as follows. 

Theorem III. 2 (Realization Theorem) For any CP 

instrument T there exists a pure indirect measurement 
model (/C, ^, t/, M) satisfying 

I{A)p = Trc{[/®i?^(A)]C/(p®|C)(^|)i7t}(102) 
I{A)*X = S^{U^[X (g)E^{A)]U} (103) 

for any state p and observable X. 

The above theorem has the following two significant 
corollaries. 

Theorem III. 3 For any POVM H there exists a pure 
indirect measurement model (/C, U, M) satisfying 

n(A) = £^{J7^[/ E^{A)]U} (104) 

for any Borel set A. 

Proof runs as follows. Note that given POVM 11 and any 
fixed state po, the relation 

J(A)p = Tr[n(A)p]po. (105) 

defines a CP instnuncnt X(A) with 

I{A)*X=n{A)po{X). (106) 

Then, by the relation n(A) = X(A)*J, the assertion fol- 
lows immediately from Theorem 111.2. 

Theorem III. 4 For any trace-preserving CP map T on 
rc(7i) there exists a pure indirect measurement model 
(/C, ^, U, E) with projection E satisfying 

Tp = TT,c{U{p(^\0{m^}, (107) 
T*X = £^[U''{X(^I)U] (108) 

for any Borel set A. 



This representation was also given by Kraus [40] inde- 
pendently. Proof runs as follows. Note that given trace- 
preserving CP map T and any fixed fixed probability 
measure p,, the relation 

I{A)p = m(A)T(p). (109) 

defines a CP instrument I{A) with 

I{A)*X = p{A)T*{X). (110) 

Then, by the relation T = I(R), the assertion follows 
immediately from Theorem III. 2. 
We summarize the results. 

Theorem III. 5 The operational distribution of any ap- 
paratus with indirect measurement model is a CP instru- 
ment, and conversely every CP instrument is obtained in 
this way with a pure indirect measurement model. 

From the realization theorem, every CP instrument I 
has a pure indirect measurement model {IC,^,U,M). In 
this case, from Eqs. (36) and (78) we have 

= TrmAYAMm 

= Tv{W[A^E'^'{A)]U{\i^){^\ ® 10(^1)} 

= {'tp'»^\U^[A<^E'^{A)]U\tp®^). 

Since tp G Ti. is arbitrary, we have 

I{A)*A = {^\U^[A^E^{A)]U\0. (Ill) 

Let V be the linear transformation from H to K defined 
by 

V'il) = U{'il)®C) (112) 

for all V' € W. Then, we have 

V^V = I. (113) 

Now, we have the following useful representation applied 
to every CP instruments: 

J{AyA = V'^[A(S>E'^{A)\V (114) 

for any A e C(n) and A G B{A). 

Under the realizability postulate, any CP instrument 
has a corresponding apparatus. From the three postu- 
lates discussed above, we conclude that the set of statis- 
tical equivalence classes of apparatuses is in one-to-one 
correspondence with the set of CP instruments. 

We have also another useful conclusion: Any appara- 
tus is statistically equivalent to an apparatus with indirect 
measurement model. Thus, when we discuss statistical 
properties of all the possible measurements, we can as- 
sume without any loss of generality that the apparatus 
under consideration has an indirect measurement model. 
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IV. NOISE IN MEASUREMENTS 
A. Measurements of observables 

Let A(x) be a measuring apparatus with indirect mea- 
surement model (/C, tr, U, M). Let A be an observable of 
the object S. As defined previously, A(x) precisely mea- 
sures A if and only if A(x) satisfies the BSF for observ- 
able A on every input state. 

In order to clarify the meaning of the above definition, 
let us examine the case where observable A has a com- 
plete orthonormal basis of eigenvectors. Then, we can 
write 

A = ^an\an,v){an,y\, (115) 

where a„ varies over all eigenvalues and v is the degener- 
acy parameter. From Eq. (16), it is obvious that if A(x) 
precisely measures A, then A(x) outputs a„ with proba- 
bility one on input state |a„, i^). In what follows, we shall 
show that the converse is also true. Suppose that A(x) 
outputs a„ with probability one on input state |a„, z^) for 
all n and v. Then we have 

Pr{x = a„|| |am,z/)} = (5ot,„, (116) 

so that the POVM H of A(x) satisfies 

{am, z^|n{a„}|a„, v) = Sm,n- (117) 

Consequently, 

^{an}\am, v) = 5m,n\am, I')- (US) 

Hence, we have 

'^{o-n} = ^\an,J^){an,-l^\ (119) 

for all n. Thus, we conclude 

n(A) = £;^(A) (120) 

for all Borel set A, so that apparatus A(x) precisely mea- 
sures observable A. Thus, apparatus A(x) precisely mea- 
sures observable A if and only if it outputs the value of A 
whenever the object has a definite value of A just before 
the measurement. 



B. Noise in direct measurements 

Measurement noise should be defined to be the dif- 
ference between the true value of the quantity to be 
measured and the output from the measuring appara- 
tus. This is meaningful in classical mechanics, but there 
is a diflaculty in quantum mechanics, since we cannot al- 
ways expect that the definite true value exists. However, 
this does not mean that we cannot define the average 



amount of noise of the measurement in a given state. In 
fact, if we can identify the noise with a physical quan- 
tity, we can describe the statistical properties of the noise 
even in quantum mechanics . We shall call this physical 
quantity the noise operator. 

In this subsection, we consider a case where the noise 
operator can be determined easily. We suppose that in 
order to measure an observable ^ in a given state, the 
observer actually make a precise measurement of another 
observable X, the meter observable, in the same state. In 
this case, it is natural to define the noise operator to be 
the observable 

N{A) =X-A. (121) 

Accordingly, the root-mean-square (rms) noise of this 
measurement in the input state ijj should be defined to 
be 

eiA) = (V.|A^(A)2 1^)1/2 = - Afli;}'/^. (122) 

The above formula is easily rewritten as 

e{A) = m)-Am\, (123) 

and hence the rms noise e{A) has properties of distance 
between two vectors A\tp) and X\tp). 

If the observable A has a definite value a in the state 
■tp, i.e., A\tlj) = a\tjj), we have 

e{A) = (^|(X-a)V)V2 (124) 
= (^j^{x-af{^\dE''{xm^ , (125) 

and hence e(A) is the root-mcan-square of the diflference 
between the output x and the true value a. 

Let {A), {X), a'{A), and a(X) be the means and the 
standard deviations of observables A and X, respectively, 
in state ■0. Then, we have a{A) = \\A\^) - {A)\^)\\ and 
so on. From the triangular inequality for the distance 
between vectors, we have 

m)-{x)m\ < \\x\i,)-Am + \\A\,p)-{A)m 
+\\{Am-{x)m (126) 

Thus, the geometric inequality Eq. (126) implies the sta- 
tistical inequality 

<j(X) < e(A)+<j(A) + \{X)~{A)\. (127) 

From an analogous inequalities for vectors, we have 

a{A) < e{A) + a{X) + \{X)^{A)l (128) 
eiA) < a{A)+aiX) + \{X)-{A)\. (129) 

From the above, we have 

\a{X)-a{A)\<eiA) + \{X)-{A)\, (130) 

and hence the increase and decrease of the standard de- 
viation of the output from the standard deviation of the 
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measured observable in the input state is bounded from 
above by the rms noise plus the bias, the difference of 
their means. 

If A has a definite value and the output is unbiased, 
i.e., A\ij) = (X)IV'), from Eqs. (127) and (129) we have 

e(A)^a{X). (131) 

Thus, the rms noise in this case is identical with the 
fluctuation of the meter observable. 

If the output is constant, i.e., X = XqI, from Eq. (129) 
we have 

e{A)<a{A) + \xo-{A)\. (132) 

This inequality already shows that Heisenberg's noise- 
disturbance uncertainty relation does not cover all the 
possible ways of measuring the same observable A. In 
fact, suppose that in order to measure the position ob- 
servable Q in a state ^, the observer actually make a 
precise measurement of a constant observable X = xqI. 
Then, this measurement can be done without disturbing 
any observables, in particular, the momentum P. How- 
ever, the rms noise of this measurement is bounded by 
the finite number a{Q) + \xo — {Q)\ for any state tp with 
IIQIV')!! < CO- Thus, for this measurement the product 
of the root-mean-square noise and the root-mean-square 
disturbance vanishes uniformly over all states ip in the 
domain of the operator Q. 

C. Noise in indirect measurements 

Let A(x) be an apparatus with indirect measurement 
model (/C, a, U, M). We suppose that the apparatus A(x) 
is used for measuring an observable A in the state p at 
time t. In the Heisenberg picture with the original state 
p®a, we write A'" = A® I, = I ® M, A°"* = 
U'^{A(^I)U, and M°"t = W{I®M)U. In this subsection, 
for any observable C of S -I- P, the mean value and the 
standard deviation of C in state p®a\s denoted by (C) 
and (t(C), respectively, i.e., 

{C) = Tr[C(p0a)], (133) 
cj{C) = T,[{C-{C)f{p(^at'\ (134) 

The above definition can be rewritten as 

cj{C) = \\{C- {C))^f^\\HS, (135) 
where || • • • ||ifs is the Hilbert-Schmidt norm defined by 
\\X\\hs = VTrXtX (136) 

for any Hilbert-Schmidt class operator X, i.e., TrX^X < 

oo. Then, a simple application of the Schwarz inequal- 
ity for the inner product {X, Y) = TiX*Y on Hilbert- 
Schmidt class operators, we have 

aiC)aiD) > \Tr[iC-{C))iD-{D))p®a]\ 

> i|Tr([C,D]p®a)| (137) 



for any observables C,D with a{C), (t{D) < oo. We 
shall refer to the last inequality as the Heisenberg- 
Rohertson uncertainty relation for standard deviations or 
Heisenberg-Robertson relation, for short. 

In order to quantify the noise, we introduce the noise 
operator N{A) of A(x) for measuring A. According to 
the measuring process described in Section HI, this mea- 
surement can be described as follows: in order to mea- 
sure the observable A'" in the state p ® a the observer 
actually make a precise measurement of the observable 
M°"* in the same state. It follows that we can apply 
the definition of the noise operator given in the preced- 
ing section. Thus, we define the noise operator N{A) of 
A(x) for measuring A by 

N{A) = M°"*-yl^" (138) 
= U\l<»M)U -At^I. (139) 

The root-mean-square (rms) noise e{A, p), or denoted by 
e{A) for short, of A(x) for measuring A on input state p 
is, then, defined by 

e{A,p) = {N{Af)y\ (140) 

Using the Hilbert-Schmidt norm, the above definition can 

be rewritten as 

e{A, p) = ||M°"*Vp0^ - A'"V^0^||hs. (141) 

We shah write e(A,p) = e{A,ilj), if p = 

In order to clarify the meaning of the above definition, 
suppose that the probe preparation is a pure state a = 
|^)(^| and let us consider the observable A in Eq. (115). 
Suppose that the input state is |a„,:^). Then, we have 

N{A)\an,v)\i) = (M°"* -a„)|a„,i/)|0 (142) 

and 

e{A, |a„, u)) = ((M-* - a^f)^'^. (143) 

Thus, e{A,\an,y)) stands for the root-mean-square dif- 
ference between the experimental output M°"* and the 
true value a„ of observable A. 
If e{A, |a„, y)) = 0, we have 

M°"V„,i/)|0 = a„|an,i^)ie), (144) 

so that A(x) outputs a„ with probability one. Thus, 
we have shown that if e{A, ip) = for any eigenstates 
ip of a purely discrete observable A, then A(x) precisely 
measures A. 

For a general observable A, if the observable A has a 
definite value a in the state p, i.e., Ap = ap, we have 

e{A) = (^ljx-a f{dE^°"\x))y , (145) 

and hence e{A) is the root- mean-square of the difference 
between the output x and the true value a. 
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From the similar argument leading to Eqs. (127)-(129), 

we have 

a(M°"*) < e{A) + cr(A'") + |(M°"*) - (146) 
< e(A)+cr(M°"*) + |(M°"')-(A'")|, (147) 
e{A) < cr(A'") + cr(M™*) + |(M°^')-(A'")|.(148) 

From the above, we also have 

|a(M°"*) - o-(A'")| < e{A) + |(M°"*) - {A'")]. (149) 

D. Distance of POVMs from observables 

In the preceding subsection, we have defined the root- 
mcan-squarc noise of measurement using the associated 
indirect measurement model. Thus, this amount of noise 
apparently depends on the model; for example, two dif- 
ferent models with different boundaries between the ap- 
paratus and the observer describing the physically iden- 
tical apparatus might have different amounts of noise. In 
the next subsection, we shall show that this is only ap- 
parently the case. The root-mean-square noise depends 
only on the POVM of the apparatus and hence statis- 
tically equivalent apparatuses have the same amount of 
noise. In this subsection, we shall generally introduce a 
notion of distance between a POVM and an observable, 
which will play an important role in the study of quan- 
tum noise and disturbance in measurements. 

Let n be a POVM on a Hilbert space H. Let f{x) be 
a real Borel function on R. Denote by / f{x)dU{x), or 
J fdH for short, the symmetric operator defined by 

(^1 J f{x)dU{xM = fix) dmi^M (150) 

for any ^,r] G dom(/ f{x)dll{x)), where the domain is 
defined by 



dom l^J f{x)dU{x) 

= e ^ I ^ /(^)' dmi^M) < ooj . (151) 

The first and the second moment operators ofU, denoted 
by 0(n) and 0^'^\U), are defined by 



0(n) = j xdll{x), 



o(2)(n) 



■dn{x). 



(152) 
(153) 



R 



By the Naimark theorem [56] , there is a Hilbert space 
W, an isometry V : Ti ^ W, and a self-adjoint operator 
C such that 



for every Borel set A. We shall call any triple (W, V, C) 
satisfying Eq. (154) a Naimark extension of 11. By inte- 
grating the both sides of Eq. (154), we have 



o(n) = v^cv, 
o(2)(n) = Ft(7V. 

Since V'<C^V > V^CVV^CV, we have 

o(2)(n) > o{iif. 



(155) 
(156) 



(157) 



Let A and p be an observable and a density operator on 
H. We define the distance dp{n,A) of POVM H from 
observable A in p by 

dp(n,A) 

= Tr{[o(2)(n) - o(n)2 + (o(n) - Af]pY/^ (i58) 
= TV[o(2)(n) - o{n)A - AO{n) + A^)pf'^. (159) 

We shall abbreviate d\^)t^^\ as d^ for a vector state ^. 

In the case where 11 is the spectral measure of an ob- 
servable X, i.e., n = , we have 

0{E^) = jxdE^{x)=X, (160) 
0(^){E^) = [ x'dIl{x)=X'^. (161) 

Consequently, we have 

dp{E'',A) = TT[{X-AfpY'^ 

= \\X^p-A^p\\hs. (162) 

Thus, the distance dp generalizes the distance of two ob- 
servables given by - A^\\hs- 

Now, we have the following properties of the distance 
dp. 

Theorem IV. 1 Let A and p be an observable and a 
density operator on H. For any Naimark extension 
(W, V, C) of a POVM U on H, we have 



dp{n,A) = \\CV^p-VA^p\\Hs- 



(163) 



n(A) = V^E^{^)V 



(154) 



Proof. The assertion follows from Eqs. (155), (156), 

(159), and the relations 

\\CV^p-VA^p\\ls 

= Tr[{CV^ - VA^p)\CV^p - VA^p)] 
= TiliV^C'^V - V'<CVA - AV^CV + A'^)p]. 

QED 

Theorem IV. 2 A POVMH on H is a spectral measure 
of an observable A on TL, i.e, 11 = E^ if and only if 
d^(n, A) — for any state vector tjj gH. 
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Proof. From Eq. (162), if H = E^, wc have dp{n,A) = 
for any p. Conversely, suppose that (11, A) = for 
all state vector ijj <E TL. Let (W, V, C) be a Naimark 
extension of 11. From Theorem IV. 1, we have 

for all V' € W. Thus, we have 

CV = VA, (165) 

and hence CVV^ = VAV''. By taking the adjoint of 
the both sides, we have Since is 

a projection, it follows that all the spectral projections 
E'^'{A) commutes with VV^. Since V is isometry, i.e., 
VW = I, we have 

{A)VV^ {A)V = V^E^{A)V. (166) 

Thus, n(A) = V^E'^'{A)V is projection valued. Prom 
Eq. (165), we have also 

A = V'^CV = j XdU{X). (167) 

By the uniqueness of the spectral decomposition, we con- 
clude that n is the spectral measure of A, i.e., 11 = E^. 
QED 

Corollary IV.3 For any POVM H on H and any ob- 
servable A onH, the following conditions are equivalent. 
(i)U = E^. 

(a) dp(n, ^) = for any state p. 

(Hi) dp(Jl, A) = for a faithful state p. 

(iv) d\ri) (n. A) =Q for any \n) in an orthonormal basis 
{\n)}. 

(v) d^(n, A) = for any state vector ip gH. 

Proof. The implication (i) (ii) is an imnic;diate con- 
sequence of Eq. (162), and the implication (ii) (iii) 
is obvious, since a faithful state exists on any separa- 
ble Hilbert space. To show the implication (iii)=>(iv), 
assume that dp{n,A) = for a faithful state p. Let 
(W, V, C) be a Naimark extension of II. From Theorem 
IV. 1, we have 

{CV - VA)p = 0. (168) 

Let |1),|2),... be an orthonormal basis consisting of 
eigenvectors of p. Then, we have p\n) = Pn\n) with 
Pn > for all n. Thus, applying the both sides of 
Eq. (168) to the vector p~^\n), we have 

{CV -VA)\n) =0. (169) 

By Theorem IV. 1, we have d|„^(n, ^) = for all |n), 
and (iii)=>(iv) has been shown. To show the implication 
(iv)=4>(v), assume that d|„)(n, ^) = for an orthonormal 
basis |1), |2), Prom Theorem IV. 1, we have 

{CV -VA)\n) = 0. (170) 



By linearity, it follows easily that for any state vector 
ip GH, we have 

{CV -VA)\ilj) =0. (171) 

Thus, wc conclude d^{Il, A) = for any state t/j, so that 
(iv)=>(v) has shown. Since the implication (v)=>(i) has 
been proven in the proof of Theorem IV. 2, this completes 
the proof. QED 

Theorem IV. 4 Let C be an observable on Hilbert space 
H ® IC and let a be a density operator on K,. If He is a 
POVM defined by 

Ilc{A) = £4E^{A)] (172) 
for any A e S(R), then we have 

dp{Uc,A) = \\C^/pWa-A(^I^/pWa\\HS■ (173) 

Proof. By integrating the both sides of Eq. (172), we 
have 

0{Uc) = Tr;c[C{I^a)], (174) 
0(2) (He) = TrK[C\l^a)]. (175) 

Thus, by the properties of the partial trace, we have 

Ti[0{nc)Ap] = TT[C{A^I){p^a)], (176) 
Tr[AO{nc)p] = Ti[{A(g)I)C{p(S)a)], (177) 
Tr[0(2)(nc)p] = Tr[C^p^a)]. (178) 

Thus, we have 

yCVpO o- - A (g) IVp^Whs 

= Tv[C'^{p(g)a)]-TT[C{A(g)I){p(g)a)] 

- Tv[{A ® I)C{p <E) a)] - Ti[{A^ O /)(p (g) a)] 
= Tr[(0(2)(nc) - 0{nc)A - AO{Uc) + A^)p\. 

Thus, Eq. (173) follows from Eq. (159). QED 

E. Model independent definition of noise 

The following theorem shows that the root-mcan- 
square noise of an apparatus is determined only by its 
POVM, and hence statistically equivalent apparatuses 
have the same amount of noise. 

Theorem IV. 5 Let A(x) be an apparatus with indirect 
measurement model (/C, a, U, M) . Then, the rms noise 
e{A,p) is determined by the POVM II o/ A(x) as 

e{A,p) = dp{Il,A). (179) 

Proof. The assertion follows from Eq. (141) and The- 
orem IV.4 for C = t/'f(J O M)U. QED 
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Wc define the root-mean-square (rrn,s) noise e{A, p) of 
appa/rafAis A(x) for measuring observable A in state p to 
be the distance dp{U,A) of the POVM H of A(x) from 
observable A in state p. As above, this definition is con- 
sistent with the definition for apparatuses with indirect 
measurement models. 

The following theorem asserts that apparatuses pre- 
cisely measuring A and apparatuses with numerically 
zero rms noise for A are equivalent notions. 

Theorem IV. 6 An apparatus A(x) precisely measures 
an observable A if and only if €{A, p) = on any input 
state p. 

Proof. Let 11 be the POVM of an apparatus A(x). 
Then, A(x) precisely measures A if and only if 11 = E"^. 
Thus, the assertion follows immediately from Theorem 
IV.2. QED 

Let (x) and fT(x) be the mean and the standard devi- 
ation of the output variable x of the apparatus A(x) in 
state p. Then, we have 



In the above, {■ip\A'^\tp) is the theoretical mean value of 
^2 in state V, {i^\0'-^>{n)\tp) is the mean of the squared 
output x^ in state ip, and the other terms are the means 
of the output X in the respective input states. Thus, the 
error €{A, tp) can be statistically estimated, in principle, 
from experimental data of the measurements in states ■i/', 
AV/PV'II, and (^ + J)V/||(^ + /)VII- 



F. Relations to other approaches 

In Refs. [19, 20, 23] the notion of rms noise was previ- 
ously introduced for a restricted class of measurements. 
In what follows, we shall show that those definitions are 
equivalent to the general definition introduced above. 

Let A be an observable of S. A POVM 11 of S is said 
to be compatible with A, or A compatible for short, if it 
satisfies the relation 



[n(Ai),ii;^(A2)] = o 



(190) 



for all Ai,A2 e B{K). 

Let /9 be a state. For an A-compatible POM 11, the 
joint probability distribution of 11 and A in state p is 
defined by 



(x) = / a; Pr{x e rfa;||p}, (180) 
a(x) = (^j {x- (x))2 Pr{x e dx\\p}^ ' ' . (181) '^^Ai x A2) = Tr[n(Ai)i?^(A2)p] (191) 



Prom Eqs. (11), (152), and (153), we have 

(x) = Tr[0(n)p], (182) 
a(x) = (Tr[0(2)(n)p]-Tr[0(n)p]2)'^'. (183) 

From Eqs. (146)-(148), we have 

a(x) < e{A) + a{A) + \{K)-{A)\, (184) 
a{A) < e(A)+a(x) + |(x)-(A)|, (185) 
e{A) < a(A)+a(x) + |(x)-(A)|. (186) 

In particular, we have 

|a(x)-a(^)| < e(^) + Kx)-(^)|. (187) 

In this subsection, we have shown that the rms noise of 
an apparatus is defined independent of a particular model 
to describe the measuring process of the apparatus. This 
suggests that the rms noise can be statistically estimated 
from the experimental data. In fact, this can be done as 
follows. Let n be a POVM and let A be an observable. 
By the relation 

0{U)A + AO{U) = {A + I)0{U){A + I)-AO{U)A 
- 0(n), (188) 

we have 

d^{ii,Af = im'm + iiAo^^Hmi') 

+ (VlO(n)IV') + {Ai;\OmAi,) 
-{{A + I)^\0{U)\{A + I)^). (189) 



for any Ai,A2 € e(R). By Eq. (190) it is easy to see 
that Eq. (191) defines a unique Borel measure on R^. As 
a notational convention, we shall write 



jj^J{x,y)dpf^^\x,y) 



IL 



f{x,y)T:v[dn{x)dE^{y)p] (192) 



for a Borel function f{x,y) on R^. If f{x)g{y) is a 
/Up^''^^(a;, y)-integrable function on R^, then we have 



f{x)g{y)TT[dn{x)dE^{y)p] 



= Tr 



{Jfdu) 



giA)p 



(193) 



Now, let us assume that the POVM 11 of an apparatus 
A(x) is compatible with an observable A. Then, we have 



IL 



{x-yfTr[dIl{x)dE^{y)p] 



-IL 



x'^Tv[dn{x)dE'^{y)p] 



2 jj^^ xyT,[dR{x)dE^{y)p] 

y'^i:i[dIl{x)dE'^{y)p\ 



= Tr[0(2)(n)/?] - 2Tr[0(n)Ap] + Tv[A^ p] 

= Tr{[o(2)(n)-o(n)2 + (x-A)2]p} 

= dp{Il,A). 
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Thus, by Theorem IV. 5, we have 

e{A, pf = (x - yf TT\d^{x)dE^{y)p\. (194) 

The above relation shows that the rms noise e(A, p) rep- 
resents the root-mean-square deviation of the output x 
of the measurement using A(x) from the output y of an 
precise A measurement using another apparatus A(y), 
when these two were made simuhaneously in the state 
p. In Ref. [23], the rms noise of an apparatus with A- 
compatible POVM was introduced by Eq. (194). 

Let us consider the case where the object S is a one- 
dimensional mass and the observable to be measured is 
the position x of the mass. Suppose that the POVM 11 of 
apparatus A(x) to measure x is compatible with x, i.e., 

[n(Ai),i?*(A2)] =0 (195) 

for all Borel sets Ai,A2. Under this condition, there 
is a kernel function G(a, a;) called the resolution kernel, 
which may be a distribution or a generalized function, 
such that 

n(A)= [ da [ G{a,x)dE^{x) (196) 

or 

dU{a)=da[ G{a,x)\x){x\dx (197) 

in the Dirac notation. Even if the apparatus A(x) mea- 
sures position X approximately, the output probability 
distribution Pr{x e (ia||V'} on input state represented by 
a wave function ip^x) is expected to be related to the po- 
sition distribution |V'(a;)P — from Eq. (197), this relation 
is expressed in the following form 

Pv{x G da\\il>} = da [ dxG{a,x)\il>{x)\^. (198) 

Note that G{a,x) is independent of a particular wave 
function tp{x). Obviously, A(x) precisely measures x, 
i.e., 

Pr{x e da\\i)} = \tp{a)\'^ da (199) 

for all tp, if and only if G{a,x) = S{x — a). Roughly 
speaking, G{a, x) is the conditional probability density of 
the output X = a, given that the mass is in the position 
X = X at the time of measurement; hence the rms noise 
e{x, \x)) of the apparatus A(x) on input state |x) should 
satisfy 

e{x,\x)f = [ da{a- xfG{a,x). (200) 

^R 

Since our definition of the rms noise excludes the case 

where the input state is an imnormalizable state like |x), 
Eq. (200) cannot be justified. However, if the input mass 



state is a normalized wave function ipix), the rms noise 
e{x, ip) should satisfy 

e{x,ijf= f eix,\x)f\iP{x)f dx (201) 

JR 

or equivalently 

e{x,i)f = JJ da{a-xfG{a,x)\tlj{x)\'^dx. (202) 

The following computations show that Eq. (202) is actu- 
ally derived from our general definition. For p = ['(/') (V'|, 
we have 

Tr[n(Ai)i;*(A2)p] 

= (v|n(Ai)E*(A2)|^^) 

= [ da [ G{a,x){ij\dE^{x)E^{A2)\il>) 

Jai Jr 

= [ da [ G{a,x){^\dE^{x)\i)). 

JAi J A2 

Thus, by properties of Lebesgue integral, we have 

jj {x- afTr[d\l{a)dE^ {x)p\ 

= jj {x- af daG{a,x)\ip{x)\'^dx. (203) 

Therefore, from Eq. (194) we conclude that Eq. (202) 
actually holds. 



V. DISTURBANCE IN MEASUREMENT 

A. Nondisturbing measurements 

Let A(x) be an apparatus with indirect measurement 
model (JC, cr, U,M). We should generally say that appa- 
ratus A(x) does not disturb an observable B of S, if the 
nonselective state change does not perturb the dynamical 
evolution of the probability distribution of B, i.e., 

Tr{[£;^(A)(g)7]?7(p(g)£7)?7t} 
= Tr[£;^(A)e-'^^*/'^pe'^^*/^] (204) 

for any Borel set A and any input state p, where H is 

the Hamiltonian of the system S. In this paper, we as- 
sume that the apparatus carries out instantaneous mea- 
surements in the sense that the time duration At is very 
small and the coupling between S and P is very large so 
that the free evolution of S in the time interval {t, t + At) 
can be neglected. In this case, we say that apparatus 
A(x) does not change the probability distribution of an 
observable S of S on input state p, if 

Tr{[i;^(A) O I]U{p (S) a)U^ = Tr[£^(A)p], (205) 
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or in the Heisenberg picture, 

(i;^°"(A)) = (^^'"(A)) (206) 

for every Borel set A, where we write B'" = B <S) I 
and = U^B 1^ I)U . We say that apparatus A(x) 
does not disturb observable B, or A(x) is called B- 
nondisturbing, if apparatus A(x) does not disturb the 
probability distribution of observable B on any input 
state p [54]. 

The next theorem shows that nondisturbing measure- 
ments are characterized by nonselective operations, so 
that it is independent of the particular choice of the indi- 
rect measurement model associated with the apparatus. 

Theorem V.l An apparatus A(x) with indirect mea- 
surement model (/C, cr, U, M) does not disturb an observ- 
able B if and only if we have 

T*E^{A) = E^{A) (207) 

for any Borel set A, where T is the nonselective operation 
o/A(x). 

Proof. By the property of the partial trace, we have 

TT{[E^{A)^I]U{p^a)U^ 

= Tr(TrK{C/t[^^(A)® /][/(/ ®a)}p). (208) 

Thus, Eq. (205) is equivalent to 

Tr{Tv^{U^[E''{A) ® I]U{I a)}p) = Tr[E''{A)p]. 

(209) 

Since p is arbitrary, A(x) does not disturb B if and only 
if 

TrK{U^[E^{A) I]U{I ^ a)} = E^{A) (210) 

for any Borel set A. Thus, by Eq. (82) we conclude that 
A(x) docs not disturb B if and only if Eq. (207) holds 
for any Borel set A. QED 

B. Joint measurements with nondisturbing 
apparatuses 

The relation between simultaneous measurements 

and nondisturbing measurements wore investigated in 
Ref. [54] and it was proven that any apparatus A(x) 
precisely measuring an observable A does not disturb ob- 
servable B if and only if successive precise measurements 
of observables A and B, using A(x) for the A measure- 
ment, satisfies the joint probability formula for simulta- 
neous measurements in the first input state. Here, we 
shall generalize the above result for apparatuses which 
do not necessarily make a precise measurement. 

Theorem V.2 Suppose that an apparatus A(y) pre- 
cisely measures an observable B immediately after a 
measurement using an apparatus A(x) with POVM 11^^. 



Then, apparatus A(x) does not disturb observable B if 
and only if their joint output probability distribution sat- 
isfies 

Pr{xG A,yG A'jjp} = Tr[nx(A)i?^(A')p] (211) 

for any input state p and any Borel sets A and A'. In 
this case, 11 is necessarily compatible with B. 

Proof. By the realization theorem, we can assume 
without any loss of generality that the apparatus A(x) 
has a pure indirect measurement model (/C, ^, ?7, M). 

Since the apparatus A(y) precisely measures B, the 
POVM Hy of A(y) is such that Ily = E^ . Thus, from 
Eq. (40), we have 

Pr{x e A,y e A'llp} = TV{J^(A)*[£;^(A')]p}. (212) 

By Eq. (114) we have 

Jx(A)*i;^(A') = yt[£;B(A') ® e'^{A)]V (213) 

for any A, A' e B(IL), where V is such that VV' = U{il>® 
^) for all tp eH. Suppose that apparatus A(x) does not 
disturb observable B. Then, we have 

I^(R)*E^{A') = E^{A'), (214) 

and hence 

E^{A') = V^[E^{A') O I]V. (215) 
Thus, we have 

\VE^{A')-[E^{A')(^I]V\'' = 0. (216) 
Consequently, 

VE^{A') = [E^{A') (3 I]V. (217) 

By Eq. (213), we have 

Jx(A)*£;^(A') = V''[E^{A')<l)E'^{A)]V 
= V^[I^E'^iA)]VE^{A') 
= nx(A)£;^(A'). (218) 

Therefore, Eq. (211) follows. Conversely, suppose that 
Eq. (211) holds for for any input state p and any Borel 
sets A and A'. Let A = R. We have 

Tr{Jx(R)*[S^(A')]p}. = TV[£^(A')p] (219) 
for any state p. Thus, we conclude 

J^{ny[E^{A')] = E^{A') (220) 

for any Borel set A', and the assertion follows from 
Eq. (207). QED 
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C. Disturbance in indirect measurement models 

In order to quantify the disturbance, we introduce the 
disturbance operator D{B) of apparatus A(x) for observ- 
able B defined by 

D{B) = B°>^t-B'" (221) 
= U\B®I)U - B®I. (222) 

The root- mean- square (rms) disturbance ri{B,p) of ob- 
servable B by apparatus A(x) on input state is, then, 
defined by 

ry(B,p) = (D(i?)2)i/2. (223) 

We shaU write ri{B,p) = r]{B,i;) if p = \tp){^\. The 
above definition can be rewritten as 

r^{B,p) = i3'"VP^IIi/S. (224) 

From Eq. (222) we have 

D{B) = U^[B(^I,U]. (225) 

Thus, we have 

r,{B,p) = {\[B^I,U]\Y^', (226) 
and r]{B, p) = if and only if [B (g) /, U]p (g) a = 0. 

D. Model independent definition of disturbance 

In the preceding subsection, we have defined the rms 
disturbance of apparatus using the associated indirect 
measurement model. In what follows, we shall show that 
the rms disturbance is determined by the nonselective 
operation of the apparatus and hence depends only on 
the statistical equivalence class of the apparatus. 

The following theorem shows that the rms disturbance 
of an apparatus determined only by its nonselective op- 
eration. 

Theorem V.3 Let A(x) be an apparatus with indirect 

m,easurement model (/C, cr, C, M). Then, the rms distur- 
bance r]{B, p) is determined by the nonselective operation 
T as 

viB.p) = dp{T*E^,B), (227) 
where T*E^ stands for the POVM defined by 

{T*E^){A) = r*[£;^(A)] (228) 
for any A e B(R). 

Proof. Let H be the POVM defined by 

n(A) = f^{C/t[^^(A) O I]U} (229) 



for any A e B(R). Then, by Theorem IV. 4, we have 
dp{U, B) = \\U^{B I)Uy/^W^ -BO ly/^ 

(230) 

and hence by Eq. (224), we have 

dp01,B)=n{B,p) (231) 
On the other hand, by Eq. (82) we have 

n(A) =T*£;^(A) (232) 

for any A e S(R). Thus, the assertion follows from 
Eq. (231) and Eq. (232). QED 

We generally define the root-mean-square ( rms ) distur- 
bance rj{B, p) of an observable B by any apparatus A(x) 
in state p to be the distance dp{T*E^ , B). As above, this 
definition is consistent with tlie definition for apparatuses 
with indirect measurement models. 

One of the fundamental properties of the rms distur- 
bance is that non-disturbing apparatuses and appara- 
tuses with zero disturbances are equivalent notions, as 
ensured by the following theorem. 

Theorem V.4 The apparatus A(x) does not disturb ob- 
servable B if and only if r}{B,p) = for any state p. 

Proof. From Theorem V.3, rj{B, p) = if and only if 
dp{T*E^,B) = 0. Thus, from Theorem IV.2, r]{B,p) = 
for all p if and only if T*E^ = E^ . By Theorem 
V.l, the last condition holds if and only if A(x) does not 
disturb B. The proof is completed. QED 



VI. NEW FORMULATION OF UNCERTAINTY 
PRINCIPLE 

A. Universally valid uncertainty relation 

Under the general definitions of rms noise and rms dis- 
turbance introduced in the preceding sections, we can 
rigorously investigate the validity of Heisenberg's noise- 
disturbance uncertainty relation. For this purpose, let 
A(x) be an apparatus with indirect measurement model 
{K,,a,U,M). Let A and B be two observables of the 
object. Recall that the noise operator N{A) and the dis- 
turbance operator D{B) satisfy 

M°"* = A'"-^iV(A), (233) 
^out ^ B'^' + DiB). (234) 

Since M and B are observables in different systems, we 
have [M°"*,B°"*] = 0, and hence we obtain the follow- 
ing commutation relation for the noise operator and the 
disturbance operator, 

[NiA).D(B)] + [7V(A),B"^] + [A'^Z)(B)] 

= -[A"\B% (235) 
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Taking the moduli of means in the original state p® cr oi 
the both sides and applying the triangular inequality, we 
have 

\{[N{A),DiB)])\ + \mA),B'-]) + {[A'^,D{B)])\ 

>\Tr{[A,B]p)\. (236) 

Since the variance is not greater than the mean square, 
we have 

e{A,p) > cr{N{A),p(E)<7), (237) 
viB,p) > (T{D{B),p®a), (238) 

and hence by the Heisenberg-Robertson relation, we have 



e{A,pUB,p)>\\{[N{A),D{B)])\. 



(239) 



Thus, we obtain the universally valid noise- disturbance 
uncertainty relation for the pair [A, B), 



e{A,p)rj{B,p) + ^-\{[N{A), B^^]) + {[A^\D{B)])\ 



>-mA,B]p)\. 



The above relation immediately gives rigorous con- 
ditions on what apparatus satisfies Hciscnbcrg's noise- 
disturbance uncertainty relation. Some conditions are 
listed in the following. 

Theorem VI. 1 Let A and B be a pair of observables. 

An apparatus A(x) with indirect measurement model 
{IC,a,U, M) satisfies Heisenberg's noise-disturbance un- 
certainty relation, i.e., 

e{A,p)7j{B,p) > ^\Tt{[A,B]p)\ 

for any state p for which all the relevant terms are finite, 
if one of the following conditions holds: 

(i) The noise operator commutes with and the dis- 
turbance operator commutes with A™ , i.e.. 



[iV(A),B" 
[D{B),J^^ 



(241) 
(242) 



(ii) The noise operator and the disturbance operator 
belong to the probe system, i.e., there are two observables 
N and D onK, such that 



N{A) = I®N, 
D{B) = I®D. 

B. Model-Independent formulation 



(243) 
(244) 



The above characterizations arc easily obtained, but 
depend on the model. In order to obtain intrinsic char- 
acterizations of apparatuses satisfying Heisenberg's re- 
lation, we reformulate the universally valid relation in 
terms of model independent notions. 



Let A(x) be an apparatus with POVM 11 and nonse- 
lective operation T. We now introduce the mean noise 
operator n{A) for observable A and the mean disturbance 
operator d{B) for observable B defined by 



n{A) = 0{U)-A, 
d{B) = T*{B)-B 



(245) 
(246) 



The meaning of the above operators will be clarified in 
the following argument. 

By the realization theorem, there is an indirect mea- 
surement model (/C, a, U, M) such that 

n(A) = f,[£;^°"(A)], (247) 
T*{X) = £^[U\X®I)U] (248) 

for any Borel set A and any observable X on Ti. Then, 
we also have 



(240) Thus, 



and 



o(n) = £:<,(M°"*), 

T*{B) = 5<,(B°"*). 



= o(n) - A, 



£a[D{B)] = £^[B°^'-B'^] 
= T*{B)-B. 



Thus, we have 



n{A) = £„[N{A)], 
d{B) = £a[DiB)]. 



(249) 
(250) 



(251) 



(252) 



(253) 
(254) 



Note that for any observable C on W (g) /C and any 
observable X on H, we have 

£,{[C,X®I]} = [£„{C),X]. (255) 

By the relations, 

Tr{[7V(A),i3-]p0a} = Ti {£,i[N{ A), B"^])p} 
= Tr{[£,{N{A)},B]p), 

we have 

([7V(A),B-])=Tr(KA),iJ]p). (256) 
Similarly, we also have 

{[A'^,D{B)])=Tr{[A,diB)]p). (257) 

Therefore, by substituting Eqs. (256) and (257), we 

obtain the model-independent universally valid noise- 
disturbance uncertainty relation as follows. 
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Theorem VI. 2 Let A and B be a pair of observables. 

Every apparatus A(x) satisfies the relation 

e{A,p)r]iB,p) + ^\Tr{[n{A),B]p} 

+Tr{[A,d{B)]p}\ > ^\Tr{[A,B]p)\ (258) 

for any state p for which all the relevant terms are finite, 
where 11 is the POVM o/ A(x) andT is the nonselective 
operation o/A(x). 

Before stating the conditions for Heisenberg's relation, 

we introduce some terminology. Let A and B be observ- 
ables of the system S to be measured. We say that an 
apparatus A(x) makes an unbiased measurement of A, if 
the mean output is equal to the mean of the observable 
A in the input state, i.e., 

(x) = (A'") (259) 

for any input state p. From Eq. (182), this is the case if 
and only if the first moment operator of 11 is equal to A, 
i.e., 

0(n) = A. (260) 

We say that an apparatus A(x) makes an unbiased dis- 
turbance of B, if A(x) does not change the mean of B, 
i.e., 

(S'") = (261) 

for any input state p. Since the state just after the mea- 
surement is T{p), we have 

=Tr[T*(B)p], (262) 

by the relation Tr[BT(p)] = Tr[T*{B)p\. The above re- 
lation is also obtained from indirect measurement mod- 
els. In fact, if A(x) has an indirect measurement model 
(/C, (7, U, M), then, from Eq. (250) we have 

= TV[B°"*(a«)p)] 
= Tr[£:„(B°"')p] 
= Tr[T*{B)p]. 

Since (i?'") = Tr[i?/3] and p is arbitrary, wc conclude that 
apparatus A(x) makes an unbiased disturbance of B, if 
and only if 

T*{B) = B. (263) 

We say that A(x) has statistically independent noise 
for A, if the mean noise (x) — (A'") does not depend on 
the input state p, or cquivalcntly, if the mean noise oper- 
ator n{A) is a constant operator, i.e., n{A) = rl for some 
r e R. We say that A(x) has statistically independent 
disturbance for B, if the mean disturbance (5°"*) — (B™) 
does not depend on the input state p, or equivalently, if 



the mean disturbance operator d{B) is a constant oper- 
ator, i.e., d{B) = rl for some r e R. 

The model-independent universally valid noise- 
disturbance uncertainty relation leads to rigorous 
conditions on what apparatus satisfies Heisenberg's 
noise-disturbance uncertainty relation, as follows. 

Theorem VI. 3 Let A and B be a pair of observ- 
ables. An apparatus A(x) satisfies Heisenberg's noise- 
disturbance uncertainty relation, i.e., 

e{A,p)n{B,p)>^\TTi[A,B]p)\ 

for any state p for which all the relevant terms are finite, 
if one of the following conditions holds: 

(i) The mean noise operator commutes with B and the 
mean disturbance operator commutes with A, i.e., 

[n{A),B] = 0, (264) 
[d{B),A] = 0. (265) 

(a) The apparatus A(x) has both statistically indepen- 
dent noise for A and statistically independent disturbance 
forB. 

(Hi) The apparatus A(x) makes both unbiased mea- 
surement of A and unbiased disturbance of B. 



C. Generalized noise-disturbance uncertainty 
relation 

In order to obtain the trade-off among the rms 
noise e{A,p), the disturbance r]{B,p), and the pre- 
measurement uncertainties a{A,p) and a{B,p), we ap- 
ply the Heisenberg-Robertson relation to all terms in the 
left-hand-side of the universally valid noise-disturbance 
uncertainty relation. Then, we now obtain the general- 
ized noise-disturbance uncertainty relation as follows. 

Theorem VI. 4 For any apparatus A(x) and observ- 
ables A and B, we have the relation 

e{A, p)r]{B, p) + e{A, p)a{B, p) + a{A, p)r]{B, p) 

>\\T,{[A,B]p)\ (266) 

for any state p for which all the relevant terms are finite. 

Under the finite energy constraint, i.e., a{Q),cy{P) < 
00, the above relation excludes the possibility of having 
both e(Q) = and t]{P) = 0. However, e{Q) = is 
possible with a{Q)ri{P) > fi/2; and also r}{P) = is 
possible with e{Q)a{P) > h/2. In particular, even the 
case where e{Q) = and ri{P) < e with arbitrarily small 
£ is possible for some input state with a{Q) > h/2e, and 
also the case where r]{P) = and e(Q) < e is possible 
for some input state with (t{P) > h/2e. Such extreme 
cases occur in compensation for large uncertainties in the 
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input state, while in the minimum uncertainty state with 
cr(Q) = a{P) = (S,/2)i/2, we have 

e{QUP) + sl\[e{Q)+ri{P)]>\. (267) 

Even in this case, it is allowed to have t{Q)r]{P) = 
with e(Q) = and r]{P) > (/i/2)V2 ^j^j^ ^^p^ ^ q 

and e(Q) > (?i/2)i/'. 

For the general case, we have the following trade-off 
relations for precise A measurements or B-non-disturbing 

measurements. 

Theorem VI. 5 For any apparatus A(x) and observ- 
ables A and B, if A{x) does not disturb B, we have 

e{A,p)a{B,p) > ^\T,{[A, B]p)\ (268) 

for any state p for which all the relevant terms are finite. 

Theorem VI. 6 For any apparatus A(x) and observ- 
ables A and B, if A(x) precisely measures A, we have 

a{A,pUB,p)>^\Tr{[A,B]p)\ (269) 

for any state p for which all the relevant terms are finite. 

For physical significance of the generalized noise- 
disturbance uncertainty relation, we refer the reader to 
Ref. [2, 35]. In the next section, we shall give an indirect 
measurement model that satisfies inequalities in Theo- 
rems VI.4 and VI.6 but does not satisfies Heisenberg's 
relation in Theorem VI. 3 for position measurement noise 
and momentum disturbance. 



D. Uncertainty relations for measurements with 
statistically independent noise 

Let A(x) be an arbitrary apparatus and let A, B be 
a pair of observable of the measured object. Denote by 
I, T, and 11 be its operational distribution, nonselective 
operation, and POVM respectively. Recall that the stan- 
dard deviation of the output x on input state p is given 
by 

a(x,p) = ((x-(x)2))V2 

= (Tr[0(')(n)p] - TV[0(n)p]2)V2. 

From Eqs. (184)-(186), if A(x) makes an unbised mea- 
surement of A, i.e., (x) = (A), we have 

\a{A, p) - e{A, p)\ < a(x, p) < e{A, p) + a{A, p). (270) 

In what follows, wc shall show that if A(x) has statisti- 
cally independent noise or makes an unbiased measure- 
ment of A, the standard deviation 0-(x, p) obeys a recip- 
rocal trade-off with the disturbance on any observable 
B. 



Let (/C, (T, t/, M) be an indirect measurement model 
statistically equivalent to A(x). Now, we shall return 
to the input-output relations, Eqs. (233) and (234), from 
which we have 

[M°"*,B°"*] 

= [M°"\ B'" + D(B)] 

= [M°"*,B'"] + [M°''\D{B)] 

= [^'", B'"] + [N{A), S'"] + [M°"*, D{B)]. 

By the relation [M°"*, = 0, we have 

[N{A), B'"] + [M°"*, D{B)] = -[yl^", B^"]. (271) 

Taking the moduli of the both sides in the original state 
p (g) 0- and applying the triangular inequality as before, 
we have 

|([iV(^),B^"])| + \{[M°-\D{B)])\ > mA,B]p)\. 

By the Heisenberg-Robertson relation and the relation 
a-(M°"*) = £7(x, p), we have 

a{K,pUB,p) > ^\{[M°^\D{B)]}\. (272) 

From Eqs. (256) and (272), we have 

Theorem VI. 7 Let A and B be a pair of observables. 
Every apparatus A(x) satisfies the relation 

a{^,pUB,p) + ^\Tr{[n{A),B]p}\ > ^\Tr{[A, B]p)\ 

(273) 

for any state p for which all the relevant terms are finite, 
where n{A) is the mean noise operator for A. 

From the above, we have the following reciprocal un- 
certainty relation for measurements with statistically in- 
dependent noise and unbiased measurements. 

Theorem VI. 8 Let A and B be a pair of observables. 
An apparatus A(x) satisfies the relation 

ai^,p)rjiB,p)>^\Tri[A,B]p)\ (274) 

for any state p for which all the relevant terms are finite, 
if one of the following conditions holds: 

(i) The mean noise operator commutes with B, i.e., 
[n{A),B] = 0. 

(ii) The apparatus A(x) has an statistically indepen- 
dent noise for A. 

(Hi) The apparatus A(x) makes an unbiased measure- 
ment of A. 
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E. Uncertainty relations for measurements with 
statistically independent disturbance 



for any state p for which all the relevant terms are finite, 
where n{A) is the mean noise operator for A. 



Let A(x) be an arbitrary apparatus and let A, B be 
a pair of observable of the measured object. Denote by 
I, T, and 11 be its operational distribution, nonselec- 
tive operation, and POVM respectively. For any input 
state p, the standard deviation a{B, p) is called the pre- 
measurement uncertainty of B and the standard devia- 
tion a{B,Tp) of B in the state Tp is called the post- 
measurement uncertainty of B. By the definition of the 
root-mcan-square disturbance r]{B,p), they satisfy the 
relation 

\aiB,p)-r]{B,p)\<a{B,Tp)<rj{B,p)+a{B,p). 

(275) 

If the measurement does not disturb an observable B, 
the rms noise e{A, p) is constrained by Eq. (268) so that 



<A,p) > 



m,B])\ 

2a{B,p)- 



(276) 



In what follows, we consider the more general case where 
the statistically independent disturbance or unbiased dis- 
turbance is allowed and we shall show that the rms noise 
e{A) obeys another reciprocal trade-off that is obtained 
by replacing the pre-measurement uncertainty ^{B, p) by 
the post-measurement uncertainty a{B,Tp). 

Let (/C, (J, U, M) be an indirect measurement model 
statistically equivalent to the apparatus A(x). From the 
input-output relations, Eqs. (233) and (234), we have 



= [A'"-h^'(A),B°"*] 

= [A'", B'"'] + [A'", D{B)] + [A^(A), 

By the relation [M°"*, B""^^] = 0, we have 

[N{A), + [A'", D{B)] = -[A'", S'"]. (277) 

Taking the moduli of the both sides in the original state 
p ® u and applying the triangular inequality as before, 
we have 

\{[N{A),B°^'])\ + m-^,D{B)])\ > |Tr([AB]p)|. 

By the Heiscnberg-Robertson relation and the relation 
a(B°"*) = a{B,Tp), we have 



e{A,p)a{B,Tp)> -\{[N{A),B"^'])\. 
From Eqs. (257) and (278), we have 



(278) 



Theorem VI. 9 Let A and B be a pair of observables. 
Every apparatus A(x) satisfies the relation 

e{A,p)a{B,Tp) + ^\Tv{[A,d{B)]p}\ > ^\Tr{[A, B]p)\ 

(279) 



From the above, we have the following reciprocal un- 
certainty relation for measurements with statistically in- 
dependent disturbance or unbiased disturbance. 

Theorem VI. 10 Let A and B be a pair of observables. 
An apparatus A(x) satisfies the relation 



e{A,p)a{B,Tp)>-\TT{[A,B]p)\ 



(280) 



for any state p for which all the relevant terms are finite, 
if one of the following conditions holds: 

(i) The mean disturbance operator commutes with B, 
i.e., [A,d{B)] = 0. 

(a) The apparatus A(x) has an statistically indepen- 
dent disturbance for B. 

(Hi) The apparatus A(x) makes an unbiased distur- 
bance of B. 



VII. THE MODEL BREAKING HEISENBERG'S 
RELATION 

A. Von Neumann's model 

For comparison with the model to be presented later, 
we shall start with a canonical position measurement pro- 
posed by von Neumann [6], which turns out to typically 
satisfy Heisenberg's noise-disturbance uncertainty rela- 
tion. 

Let us consider the case where the object S is a 
one-dimensional mass with position x, momentum px 
{[x,Px] = ih), and Hamiltonian Hg on the Hilbert space 
H = L^(R). Under general definitions given in the pre- 
vious sections, we can rigorously formulate Heisenberg's 
noise-disturbance uncertainty relation as 



(281) 



Let A(q) be the apparatus measuring the system S 
described as follows. The probe P of A(q) is supposed to 

be a one-dimensional system with canonical observables 
q and p {[q,p] = ih), and Hamiltonian Hp on the Hilbert 
space K. = L^(R). The probe observable is designed to 
be the coordinate q of P. The probe is also designed to be 
prepared in a state with a normalized wave function £^{q) 
just before measurement. Mathematically, we assume 
that the wave function is rapidly decreasing, i.e., ^{q) £ 
5(R), so that we have <j{q), a{p) < oo in the state ^. The 
object-probe interaction H is turned on from time t to 
t -h At. The total Hamiltonian for the object plus probe 
is taken to be 



Hs+P =Hs+Hp + KH, 



(282) 
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where K is the coupling constant. We assume that the 
coupling is so strong, i.e., if 1, that the free Hamilto- 
nians can be neglected and that the duration At of the 
coupling is chosen so that KAt = 1. 

Von Neumann [6] introduced the measuring interaction 



H = xp 



(283) 



for an approximate position measurement (see also 
Rcfs. [11, 55, 57]). Then, the unitary operator of the 
time evolution of S + P from t to t + At is given by 



(284) 



U = exp ( 'j^xp 



This measurement is, therefore, described by the indirect 
measurement model 



M(i,0 = (^L'(R),^,exp \^—xp yqj, (285) 

which has been generally introduced in Subsection III F. 
From general results in Subsection IIIF, the model 
M.{x,S,) has the operational measure 



J^(A)p = ^ i{ql - x)p^{ql - x)^ dq, 



the dual operational measure 

Jj(A)*X= / ^{ql-x)^p^{ql-x)dq, 

J A 

the POVM 

nj(A)= / \i{ql - x)\^ dq, 

J A 

the output probability distribution 

Pr{qGA||p}= / Tr[\aql-x)\''p]dq, 

J A 

and the output state 

C(g/-.r)pa'y^--'0^ 



P{q=9} 



Tr[|$(g/-i)|V] 



If the input state is a vector state p = 
have the output probability distribution 



(286) 
(287) 
(288) 
(289) 

(290) 
we also 



Pr{qeA||V}= [ dq [ l^Q - x)\^\^Ij{x)\Ux {291) 
J A Jn 

and the output state 

^{q - x)tp{x) 



V'{q=g}(a;) = 



Jj^{q-x)fmx)fdxj 

with p|q=,| = |V'{q=q})(V'{q=q}|- 



\ 1/2 



(292) 



Solving the Heisenberg equations of motion 
dX{t + T)/dT = -^[X{t + T),KH] 



(293) 



tovt<t + T<t + At, where X{t + T) is any Heisenberg 
observable of S + P, we obtain 



Xit + T) = X{t), 

q{t + T) = KTx{t)+q{t), 

Px{t + T) = Pxit) - KTp{t), 

P{t + T) = p{t). 



For T = At = 1/K, we have 

x{t + At) = x{t), 

q{t + At) = x{t)^ 

p^{t + At) = p.^{t) 

p{t + At) = Pit). 



(294) 
(295) 
(296) 
(297) 



(298) 
(299) 
(300) 
(301) 



It follows that the noise operator, the disturbance oper- 
ator, the mean noise operator, and the mean disturbance 
operator are given by 

N{x) = q{t + At) - x{t) = q{t), (302) 

Dipx) = pAt + At)-p^{t) = -p{t), (303) 

n{x) = m\S)I. (304) 

d{p.) = -{(.\m)I- (305) 

Thus, this measurement has statistically independent 
position-measurement noise and statistically indepen- 
dent momentum disturbance, so that this measurement 
satisfies Heisenberg's noise-disturbance uncertainty re- 
lation. In fact, the moan-square position-measurement 
noise and the mean-square momentum disturbance are 
given by 



e{x)' 

ViPx)'^ 



(306) 
(307) 



Therefore, we conclude that the von Neumann model 
obeys Heisenberg's noise-disturbance uncertainty rela- 
tion. 



e{x)v(Pa:) > 



n 

2' 



(308) 



as a consequence of the Heisenberg-Kennard relation 



o-(a)f^(p) > 



n 



(309) 



applied to the probe state just before measurement. 

This model represents a basic feature of the 7 ray mi- 
croscope on the point that the trade-off between the rms 
noise and the disturbance arises from the fundamental 
physical limitation on preparing the probe. It might be 
expected that such a basic feature is shared by every 
model in a reasonable class of position measurements. 
However, the next model suggests that it is not the case. 



29 



B. Time independent Hamiltonian model 

In what follows, we modify the measuring interaction 

of the von Neumann model to construct a model that 
violates Heisenberg's noise-disturbance uncertainty re- 
lation. In this new model, the object, the probe, the 
probe preparation, and the probe observable to be actu- 
ally measured are the same systems, the same state, and 
the same observable as the von Neumann model. Instead 
of Eq. (283), the measuring interaction is now taken to 
be [19] 



H = 



3\/3 



(2fp - Ip^q + xpx - qp). 



(310) 



The measuring interaction H is turned on from time t to 
t + At. The total Hamiltonian for the object plus probe 
is 



Hg+p = Hg + Hp + KH. 



(311) 



The coupling constant K and the time duration At are 
chosen as before so that K ^ 1 and KAt = 1. Then, the 
time evolution operator U for the time interval {t, t + At) 
is given by 



U = exp 



3v^?i 



{2xp - 2p^q + xpx - qp) 



(312) 



This measurement is, therefore, described by the indirect 
measurement model 



i2(R),e,exp 



— ITT 



{2xp - 2pa;q + xpx - qp) 



We shall call this model the (1, —2, 2) model, whereas the 
von Neumann model will be called the (0, 0, 1) model; for 
general (a,/3, 7) model we refer to Ref. [58]. 

For the time interval t <t + T < t + At, the wave func- 
tion ^t+rix^q) of the composite system S + P satisfies 
the Schrodinger equation 



ih 

The solution is 
'^t+Tix,q) 



d^i+r{x,q) 



dr 



KH%+r{x,q). 



(313) 



, _ , (I — Kt)^ Kttt 



2 f Kttt 
— = < —x sm — - — 
V3 I 3 



g sm ■ 



(1 + Kt) tt 



For T = At = l/K,we have 

^t+At{x,q) = ^t{q,q-x). 



(314) 



(315) 



Now, suppose that at time t, just before the cou- 
pling is turned on, the object wave function is 'ip{x) with 
(j{x), (j{Px) < 00 in the state ^^{x). Since the the probe is 



prepared in the wave function ^{q), the total wave func- 
tion is 



^t{x,q)=^l}{x)i{q). 



(316) 



At time t + At, the end of the interaction, the total wave 
function becomes 



^t+At{x, q) = i){q)(.{q - x). 



(317) 



Compare with Eq. (283); as simple as the von Neumann 
model, but the statistics is much different. 

In the above state, the probe observable q is measured 
to obtain the outcome. Thus the output probability dis- 
tribution of this measurement is given by 

Pr{qGA||V'} = I dq ( |*t+At(x, g)|2 

\i>{q)\''dq. (318) 



The output probability distribution has the probability 
density function |i/'(g)p, which coincides with the Born 

probability density of the object x just before the mea- 
surement and shows that this measurement is precise po- 
sition measurement. 

The object wave function V-'{q=g}(-^) just after this 
measurement given the output q = g is obtained (up 
to normalization) by 



^{q.=q}{x) = 



'^t+At{x,q) 



j^\^t+At{x,q)\^ dx 



1/2 



= C^q-x), 

where C (jCj = 1) is a constant phase factor depending 
only on the output q = g. The above relation can be also 
derived from a general result in Section III. Let /(x) be 
the wave function in Ti, defined by 



f{x)=i{-x) 
for all a; e R. Then, we have 



(319) 



S.{q-x) = f{x-q) 

= [exp{-iqpx/h)f]{x). (320) 



Thus, from 



U{i^(3 0ix,q)=ij{q)^{q-x) 



we have 



U{ip O 0{q) = V'(«) exp{-iqpx/^)f 



(321) 



(322) 
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From Theorem III.l, the operational distribution T sat- 
isfies 

I(A)|^)(^| 

= / \-il^{q)ex.p{-iqp^/h)f) {■ijj{q)ex.p{-iqp^/h)f\ dq 



L 
L 



exp{-iqpa,/n)\f){f\ exp{iqpa,/n)\ip{q)\'^ dq 



= j^exp{-iqp^/h)\f){f\exp{iqp^/h) 

xTV[d£«(9)|V)(V|] (323) 
It follows that the output state given q = g is 

IV'{q=«})(^i'{q=g}l 

= exp{-iqp^/n)\f){f\ex.p{iqp^/h), (324) 
and hence we have 

tp{^=q}{x) = [exp{-iqp^/h)f] (x) = ^{q - x), (325) 

up to constant phase factor. 

By linearity and continuity, from Eq. (323) the opera- 
tional distribution of this model is given by 

2^(A)p = J^exp{-iqp^/h)\f){f\exp{iqp^/n) 

xTr[dE^{q)p] (326) 

Solving the Heisenberg equations of motion for t < 
t + T<t + At, we obtain 

2 , . (l + KT)Tr 2 , . Km 
x{t + T) = -j=x{t)sm --^q{t) sm—^, 

2 , . Kttt 2 , . (1-Kt)tt 
<l{t + T) = --^x{t)sm^- + —q{t)sm , 

2 „ , , . (1 - KT)n 2 , . Kttt 
Px [t + T) = -j=p^ (t) sm '^P^^^ ~3~ ' 

2 , , , . Kttt 2 , . (1 + Kt)^ 
P{t + T-) = -j=Px{t)sm—^ + —p{t)sm . 



For T = At = 1/K, we have 



x{t + At) = x{t) - q{t), 

q{t + At) = x{t), 

p^{t + At) = -pit), 

p{t + At) = pS)+P{t)- 



(327) 
(328) 
(329) 
(330) 



It follows that the noise operator, the disturbance oper- 
ator, the mean noise operator, and the mean disturbance 
operator are given by 

N{x) = q{t + At) - x{t) = 0, (331) 
D{p^) = Px{t + At) - p^{t) = -p- p^, (332) 
n{x) = 0, (333) 
dip.) = -imOI-Px, (334) 

(335) 



Thus, the position-measurement noise and the momen- 
tum disturbance are given by 

eix) = 0, (336) 
Vip.)' = {[p.it)+mf) 

= aip.)'+'7{p)^ + [{Mt)) + {m)f- 



Consequently, we have 



<^{x)'niPx) = 0. 



(337) 



(338) 



Therefore, our model obviously violates Heisenberg's 
noise-disturbance uncertainty relation. 

If {pxit)"^) and (j;(t)2) ^ (i.e., ip and ^ tend 
to the momentum eigenstate with zero momentum) then 
we have even r]ipx{t)) with e(x) = 0. Thus, we can 
precisely measure position without effectively disturbing 
momentum in a near momentum eigenstate. 

Taking advantage of the above model, we can refute the 
argument that the uncertainty principle generally leads 
to a general sensitivity limit, called the standard quan- 
tum limit, for monitoring free-mass position [16, 19]. 



C. Time dependent Hamiltonian model 

The interaction of the preceding model Eq. (310), the 
(1, —2, 2) model, includes the term xpx—qp, which cannot 
be implemented by a simple coupling. Thus, it seems that 
this model is far more difficult than the von Neumann 
model. In this section, we shall show, however, that if we 
use time dependent interaction, the (1, —2, 2) model can 
be implemented as feasibly as the von Neumann model. 

Now, we shall consider the following model description, 
which will turn out statistically equivalent to the model 
discussed in the preceding subsection. In this model, the 
object, the probe, the probe preparation, and the probe 
observable are the same as the previous models. The 
object-probe interaction is turned on from time t to t + 
At. For the time interval t<t + T<t + At, the time 
dependent total Hamiltonian Hs+pit -|- r) of S -|- P is 
taken to be 

Hs+pit + t) = Hs®I + 1 ®Hp- KiiT)px®q 

+ K2iT)xi^p, (339) 

where the strengths of couplings, Kiir) and K^ir), sat- 
isfy 

i^i(r) = ifT^(<,f+^), (340) 
K2{t) = iiT^it+^,t + At), (341) 



KiiT)dT 



t+At 



K2iT)dT = 1. (342) 



31 



Wc assume that A< is so small that the system Hamil- 
tonians Hg and Hp can be neglected from t to t + At. 
Solving the Schrodinger equation, just as von Neumann 
model, the time evolution of S + P during the coupling 
is described by the unitary evolution operators 

U{t+^,t) = exp(^Jp,g^, (343) 

U{t + At,t+^) = exp(^-^xpj. (344) 

Then, in the position basis we have 

{x,y\U{t+ ^,t)\x',y') = {x + y,y\x',y'), 
{x,y\U{t + At,t+^)\x',y') = {x,y - x\x' ,y'}, 
and hence 



{x, y\U{t + At,t+ ^)U{t + ^,t)\x', y') 

= {y,y-x\x',y'). 



(345) 



Thus, by Eq. (321), we conclude that the unitary evolu- 
tion operator 

U = U{t + At,t + ^)U{t+^,t) (346) 

is the same as the imitary operator of the (1,-2,2) 
model. Thus, the above model is identical with the 
(1,-2,2) model. In particular, we have obtained the 
relation 



exp 



-ITT 



{2xp - 2p^q + xpx - qp) 



= exp I -^xp I exp I -p^q 



(347) 



Thus, we can avoid to implement the term xpx — qp, 

and only von Neumann type interactions xp and Pxq are 
sufSce to implement the (1,-2,2) model. 



In what follows, wc consider the rigorous formulation 
of this requirement for general measuring apparatuses. 
Let A(x) be an apparatus with output variable x. In 
order to formalize the repeatability hypothesis, we need 
to consider repeated measurements using the identical 
apparatuses on the same system. Since the same appa- 
ratus cannot be used twice in succession, we assume that 
immediately after the measurement using A(x), another 
statistically equivalent apparatus A(y) with output vari- 
able y is used for the succeeding measurement. Then, the 
repeatability hypothesis states that if x = a; then y = x 
for any x. This condition is well-formulated by the con- 
cept of conditional probability as follows. The apparatus 
A(x) satisfies the repeatability hypothesis if and only if 
the conditional probability distribution of y given x = a; 
satisfies 



Pr{y e A|x = x\\p} = xa{x) 



(348) 



for all X, A and p. Let P{x=x} be the output state given 
x = X for input state p. Then, from Eq. (48), the ap- 
paratus A(x) satisfies the repeatability hypothesis if and 
only if we have 



Pr{y e A||p{x=x}} = Xa{x) 



(349) 



for any Borel set A. 

Now, we shall consider the case where apparatus A(x) 
precisely measures an observable A. Prom Eq. (349), in 
this case A(x) satisfies the repeatability hypothesis if and 
only if we have 



Tr[E^{A)p[^^xy] = xa{x) 



(350) 



for any Borel set A. The last equality is equivalent to 

the condition 



i;^(A)p{x=x}-E"^(A) = XA(a;)p{x=x}- 



(351) 



Suppose that A has purely discrete nondcgencrate 
spectrum ai, a2, . . . with corresponding orthonormal ba- 
sis <^i, (/i2, . . . of eigenvectors. Then the repeatability hy- 
pothesis holds if and only if 



VIII. REPEATABILITY AND UNCERTAINTY 
PRINCIPLE 

A. Repeatability hypothesis and the projection 
postulate 

In formulating the canonical description of the mea- 
surement of an observable, von Neumann required not 
only that the output probability distribution satisfies the 
Born statistical formula but also that the quantum state 
reduction satisfies the following hypothesis abstracted 
from the result of the Compton-Simons experiment [6]. 

Repeatability hypothesis. // an observable is mea- 
sured twice in succession in a system, then we get the 
same value each time. 



P{x=a„} = \(l)n){(pn\ 



(352) 



for all n = 1,2, In this case, the operational distri- 
bution I of A(x) is determined uniquely by 



I{A)p= J2 \<l^n){<Pn\p\4>n){(l>r, 



(353) 



a„eA 



for any p G tc{TL) and A G B{R). Thus for any observ- 
able with purely discrete nondegenerate spectrum the re- 
peatability hypothesis determines an apparatus uniquely 
up to statistical equivalence. 

If A has, however, purely discrete but degenerate spec- 
trum then the repeatability hypothesis no longer deter- 
mines the state after the measurement. In fact, in this 
case P{x=a„} can be one of any eigenstates \'p){4>\ with 
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A(j) = an(j} or even mixtures of them. In order to deter- 
mine the state after the measurement in this case, Liiders 
[59] proposed the following requirement. 

Projection Postulate. For any input state p for a 
precise measurement of a purely discrete observable A, 
the output state P{x=a} is given by 



P{x=a} 



E^{a}pE^{a} 
TT[E^{a}p] 



(354) 



for any a G R with Pr{x = a\\p} > 0. 

According to the projection postulate, if the input 
state is a vector state ^p, i.e., p = \tp){ip\, then the output 
state P{x=a} is represented by the projection E"^{{a})tp 
of V on the eigenspace corresponding to the output a, 



I.e., 



P{x=a} 



\E^{a}i;){E^{a}i;\ 



(355) 



It is obvious that the projection postulate implies the 

rcpeatabiUty hypothesis. The projection postulate yields 
the following operational distribution 

I{A)p = E^{a}p E^{a} (356) 
for all p e TciTL). 



Motivated by the above argument, any DL instrument 
satisfying Eq. (358) for all A, A' e B{K) is said to be re- 
peatable; note that Davies and Lewis [30] called originally 
such DL instruments as "weakly repeatable" . 

Contrary to the fact that there can be many repeat- 
able DL instruments corresponding to the same purely 
discrete observables, the following theorem, conjectured 
in Ref. [30] and proved in Ref. [50, Theorem 5.1] shows 
that there arc no repeatable DL instruments correspond- 
ing to any observables with continuous spectrum. 

Theorem VIII. 1 Every repeatable DL instrument is 
discrete in the sense that there is a countable subset Aq 
o/R such that J(R \ Aq) = 0. 

It is concluded, therefore, that in order to model re- 
peatable measurements of continuous observables it is 
necessary to describe them approximately with arbitrary 
closeness or to extend the formulation of quantum me- 
chanics to include the limit of those approximate models 
[60, 61]. In Ref. [55] it was shown that we have still satis- 
factory models of approximately repeatable measurement 
of continuous observables within arbitrarily small error 
limit in the standard formulation of quantum mechanics. 



C. Approximate repeatability 



B. Discreteness of repeatable instruments 

Now we shall consider the general case where A may 
have a continuous spectrum or even A(x) makes no pre- 
cise measurement of an observable. Let us assume that 
a measurement using A(x) is immediately followed by a 
measurement using another statistically equivalent appa- 
ratus A(y). Let X be the common operational distribu- 
tion of those apparatuses. It follows from Eq. (47) and 
(348) that the repeatability hypothesis holds if and only 
if 

Pr{x e A, y e A'IIp} = Pr{x e A n A'IIp} (357) 

where A, A' G B(R). Thus, from Eqs. (31) and (33) we 
conclude that apparatus A(x) satisfies the repeatability 
hypothesis if and only if the operational distribution I 
satisfies 

Tr[J(A')X(A)p] = Tr[J(A n A')p\ (358) 
or equivalently 

J(A')*2:(A)*/= J(An A')*/ (359) 

for any input state p and A, A' G B(R). The above con- 
ditions are also restated as A(x) satisfies the repeatabil- 
ity hypothesis if and only if the operational distribution 
I and the POVM of A(x) satisfies 

i(A)*n(A') = n(A n A'). (360) 



Whereas von Neumann considers only precise mea- 
surements of observables and introduced the repeatabil- 
ity hypothesis for canonical description of state changes 
caused by measurements, the von Neumann model does 
not satisfy the prcciscness nor the repeatability One 
of the characteristic features of our (1, —2, 2) model is 
that it precisely measures position, but our model does 
not satisfy the repeatability hypothesis cither. Thus, it is 
tempting to understand that the (1, —2, 2) model circimi- 
vent Heisenberg's noise-disturbance uncertainty relation 
by paying the price of failing the repeatability. In what 
follows we shall show that such a view cannot be sup- 
ported. 

In the first place, as discussed in Subsection VIII A, the 
repeatability hypothesis can be satisfied only by measure- 
ments of purely discrete observables. Thus, no precise 
position measurements satisfy the repeatability hypoth- 
esis. 

Secondly, if we consider the approximate repeatabil- 
ity, our model satisfies any stringent requirement of ap- 
proximate repeatability. In order to show this, we need 
the measure of approximate repeatability introduce by 
Ref. [55]. 

Let X be a DL instrument. We define the root-mean- 
square repetition error of I on input state p, denoted by 
R{I,p), as follows. 

R{I,p)=(^jj^{x-yfTT[dI{x)dX{y)p]^ . (361) 
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We shall write R{I,ip) = R{I,\^j){t]j\). Since 
Tr[X(A)X(A')/9] represents the joint probability distribu- 
tion of the ouputs of the repeated measurements of sta- 
tistically equivalent apparatuses with operational distri- 
bution T, the interpretation of the above error is obvious. 
Then we have the following. 

Theorem VIII. 2 A DL instrument I is repeatable if 
and only if I satisfies 

R{I,p) = 
for any density operator p . 

For the proof, we refer to Rcf. [55]. 

A DL instrument I is said to be e-repeatable if I satis- 
fies R{I, p) < sple for any density operator p. Now, it is 
natural to say that an apparatus or an indirect measure- 
ment model is said to be e-repeatable if the corresponding 
operational measure is e-repeatable. 

Suppose that we measure the position of mass x in suc- 
cession using two apparatuses described by the identical 
indirect measurement models with operational distribu- 
tion I. Suppose that the first apparatus with probe q 
interacts with x in {t, t + At) and the second apparatus 
with probe q' interacts with x in {t + At, t + 2 At). Then, 
the root-mean-square repetition error R of the above ap- 
paratus is the root-mean-square difference between the 
first output q{t + At) and the second output q{t + 2 At), 
i.e., 

R^ - {tl^(S)^<E)^\[q'{t + 2At)-q{t + A)f\i^(S)^(S)0- (362) 

If the apparatuses are described by the von Neumann 
model, we have 

q'{t + 2At) = x{t + At) +q'it + At) 
= x{t) + i'{t + At) 
q{t + At) = x{t) + q{t). 

Thus, we have 

[i'{t + 2At) - q{t + At)]'' 

= q'{t + At)^ - 2q'{t + At)q{t) + ^(i)^. 

Since q'{t+At) and q(t) are statistically independent and 
identically distributed we have 

(V O ^ O i\W{t + 2Ai) - q{t + A)f\il} <^^<^0 

= mmf\C)-mm?) 
= Mm?- 

Thus, we have 

R = V2a{q{t)). (363) 

If the apparatuses are described by the (1, —2, 2)model, 
we have 

q'{t + 2At) = x{t + At) =x{t) - q{t), (364) 
q{t + At) = x{t), (365) 



and hence 

q'{t + 2At) - q{t + A) = -q{t). (366) 
Thus, we have 

R={q{t)y^'. (367) 

Thus, for the probe preparation ^ such that {q{t)) = 0, 
the von Neumann model has 

e{x) = am) (368) 
R = V2a{q{t)), (369) 

and the (1, —2, 2)model has 

e{x) = (370) 
R = a{q{t))- (371) 

Thus, for the identical preparation of the probe, the 
(1, —2, 2) model is concluded to be a a{q{t))/\/2- 
repeatable precise position measurement, whereas the 
von Nemnann model is a cr (g(t))-repeatable a{q{t))- 
precise position measurement. 

Therefore, we conclude for any small e > we have an 
e-repcatabk; precise position measiircmcnt that violates 
Heisenberg's noise-disturbance uncertainty relation (1). 
This suggests that how stringent conditions on precise- 
ness and repeatability might be posed for a class of posi- 
tion measurements, we can find in that class at least one 
position measurement that violates Heisenberg's noise- 
disturbance uncertainty relation. 



IX. CONCLUDING REMARKS 

In Ref. [35], we have obtained the universally valid 
noise-disturbance uncertainty relation Eq. (240) and 
the generalized noise-disturbance uncertainty relation 
Eq. (266), and also derived Theorems VI. 1, VI. 5, and 
VI. 6 in the model dependent formulation. However, the 
following problems have been remained open concerning 
the foundations of the model dependent approach. (I) 
Can every measuring apparatus be described by an indi- 
rect measurement model? (II) Are the root-mean-square 
noise and disturbance uniquely determined independent 
of the model? 

Indirect measurement models, originally introduced by 
von Neumann [6] and generally formalized in Ref. [32, 33] , 
are powerful tool to study measuring processes, since the 
interaction between the measured object and the appa- 
ratus is described purely by quantum mechanics. This 
merit is strongly contrasted with a conventional view that 
the measiiring interaction involves the macroscopic part 
of the apparatus. Although some measuring apparatuses, 
especially in the attempts for quantum nondemolition 
measurements [26], allow indirect measurement model 
descriptions, it is still difficult to convince any schools 
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of measurement theory of the affirmative answer to ques- 
tion (I) above. However, the present paper has shown 
that in order to establish uncertainty relations for noise 
and disturbance the use of indirect measurement models 
is justified regardless of the answer to question (I). 

The strategy taken in the present paper is as follows. 
We have started with listing up properties that obviously 
every measuring apparatus obeys, and then proven that 
every apparatus satisfying those properties is statistically 
equivalent to an apparatus described by an indirect mea- 
surement model. In the next step, we have proven that 
the root-mean-squarc noise and disturbance are deter- 
mined by the POVM and the nonselective operation, re- 
spectively, of the apparatus, so that question (II) above 
has been answered affirmatively. This means that if two 
apparatuses are statistically equivalent, they have the 
same root-mean-square noise and disturbance. Thus, if 
a formula for root-mean-square noise and disturbance is 
proven for one apparatus with an indirect measurement 
model, every apparatus statistically equivalent to that 
apparatus obeys the same formula. In this way, we have 
justified the assertion of Ref. [35] that those formulas ob- 
tained for apparatuses with indirect measurement model 
are universally true for every apparatus irrespective of 
the model that describes the apparatus. 

As properties that obviously every measuring appa- 
ratus obeys, we have proposed the following axioms for 
general measuring apparatuses. 

(i) Mixing law: If two apparatuses are applied to a 
single system in succession, the joint probability distri- 
bution of outputs from those two apparatuses depends 
affinely on the input state. 

(ii) Extendability axiom: Every apparatus measuring 
one system can be trivially extended to an apparatus 
measuring a larger system including the original system 
without changing the statistics. 

(iii) Realizability postulate: Every indirect measure- 
ment model corresponds to an apparatus whose measur- 
ing process is described by that model. 

From axioms (i) and (ii), we have demonstrated that 
statistical properties of any apparatus is described by 
a normalized completely positive map valued measure, 
called a CP instrument. Then, it has been shown that 
two apparatus are statistically equivalent if and only if 
they corresponds to the same CP instrument. Thus, the 
set of the statistical equivalence classes of all appara- 
tuses are considered to be a subset of the set of all CP 
instruments. From the realization theorem of CP instru- 
ments (Theorem III. 2) and axiom (iii), we have further 
concluded that the statistical equivalence class of appa- 
ratuses are in one-to-one correspondence with the CP 
instruments. Thus, we can conclude that every appara- 
tus is statistically equivalent to at least one apparatus 
which is described by an indirect measurement model, in 
which the measuring interaction is simply described by 
a quantum mechanical interaction between two quantum 
mechanical systems, the object and the probe. 



There have been many attempts to define the root- 
mean-square noise for some special classes of measure- 
ments. In Section IV we have shown that all those con- 
vincing attempts are equivalent to our notion of the dis- 
tance of a POVM from an observable, based on which we 
define the root-mean-square noise of an arbitrary mea- 
surement in the model independent formulation. The 
empirical adequacy of our definition can be supported by 
the following reasons, (i) Our definition satisfies the re- 
quirement that if the measured observable has a definite 
value in the input state, the root-mean-square noise be 
the root-mean-square of the difference between the true 
value and the measured value (Eq. (145)). (ii) Our defi- 
nition satisfies the requirement that measurements with 
uniformly zero root-mean-square noise coincide with pre- 
cise measurements (Theorem IV. 6). (iii) The difference 
between the standard deviations of the measured observ- 
able and of the measured value is bounded from above by 
the root-mean-square noise plus the bias, namely, the dif- 
ference of their means (Eq. (187)). (iv) The root-mean- 
square noise in any input state can be statistically esti- 
mated from the experimental data (Eq. (189)). (v) The 
root-mean-square noise defined through the noise opera- 
tor has a clear geometric interpretation (Eq. (123)). (iv) 
Even if another observer describes the same apparatus by 
a different indirect measurement model and identify the 
noise operator in a different way, the root-mean-square 
noises for both observers are equal (Eq. (140)). 

In Ref. [2] , we have discussed two distinct types of mea- 
surements in which Heisenberg's noise-disturbance un- 
certainty relation is violated for position measurement 
noise and momentum disturbance uniformly for any in- 
put state. These cases are generalized in Theorem VI. 5 
for type I violation and Theorem VI. 6 for type II viola- 
tion. These relations clearly reveals possibilities of mea- 
surements beyond Heisenberg's relation such as Yuen's 
contractive state measurement [16] and clarifies the new 
constraints for measurements beyond Heisenberg's rela- 
tion. An experimental realization of a measurement with 
type II violation for optical quadrature measurement is 
proposed in Ref. [35]. This measuring interaction is 
equivalent to the (1,-2,2) model, discussed in Section VII, 
which realizes Yuen's contractive state measurement as 
demonstrated in Ref. [19], so that the realization of this 
measurement with required accuracy will open a way to 
the new technology for supersensitive sensors. 
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